SlideShare a Scribd company logo
1 of 10
Download to read offline
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org
Page | 79
Paper Publications
Improving Service Recommendation Method
on Map reduce by User Preferences and
Reviews
1
Dayanand Bhovi, 2
Mr.Ashwin Kumar
2
Sr. Assistant professor, 1,2
Dpt. Of Computer Science & Engg. Mangalore Institute of Technology and Engineering
Mangalore, Karnataka, India
Abstract: Service recommender systems have been shown as valuable tools for providing appropriate
recommendations to users. In the last decade, the amount of customers, services and online information has grown
rapidly, yielding the big data analysis problem for service recommender systems. Consequently, traditional service
recommender systems often suffer from scalability and inefficiency problems most of existing service
recommender systems present the same ratings and rankings of services to different users without considering
diverse users' preferences, and therefore fails to meet users' personalized requirements. In this paper, to address
the above challenges and presenting a personalized service recommendation list and recommending the most
appropriate services to the users effectively. Specifically, keywords are used to indicate users' preferences, and a
user-based Collaborative filtering algorithm is adopted to generate appropriate recommendations.
Keywords: recommender system, user preference, keyword, Big Data, mapreduce, Hadoop.
I. INTRODUCTION
In recent years, the amount of data in our world has been increasing explosively, and analyzing large data sets—so-called
“Big Data”— becomes a key basis of competition underpinning new waves of productivity growth, innovation, and
consumer surplus [1]. Then, what is “Big Data”?, Big Data refers to datasets whose size is beyond the ability of current
technology, method and theory to capture, man-age, and process the data within a tolerable elapsed time. Today, Big Data
management stands out as a challenge for IT companies.
The solution to such a challenge is shifting increasingly from providing hardware to provisioning more manageable
software solutions [2]. Big Data also brings new opportunities and critical challenges to industry and academia [3]
[4].Similar to most big data applications, the big data tendency also poses heavy impacts on service recommender
systems. With the growing number of alternative services, effectively recommending services that users preferred has
become an important research issue. Service recommender systems have been shown as valuable tools to help users deal
with services overload and provide appropriate recommendations to them. Examples of such practical applications include
CDs, books, web pages and various other products now use recommender systems [5], [6], [7].
Over the last decade, there has been much research done both in industry and academia on developing new approaches for
service recommender systems [8], [9].
Cloud Computing and Map Reduce:
Cloud computing is a successful paradigm of service oriented computing and has revolutionized the way computing
infrastructure is abstracted and used. The major goal of cloud computing is to share resources, such as infrastructure,
platform, software, and business process [14].
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org
Page | 80
Paper Publications
Cloud computing can provide effective platforms to facilitate parallel computing, which has gained significant attention in
recent years to process large volume of data. There are several cloud computing tools available, such as Hadoop
(http://hadoop.apache.org/), Mahout (http://mahout.apache.org/), MapReduce of Google [15], the Dynamo of
Amazon.com [16], the Dryad of Microsoft and Neptune of Ask.com [17], etc. Among these tools, Hadoop is the most
popular open source cloud computing platform inspired by Map Reduce and Google File System papers [18], which
supports Map Reduce programming framework and mass data storage with good fault tolerance.
Map Reduce is a popular distributed implementation model proposed by Google, which is inspired by map and reduce
operations in the Lisp programming language. More details about Map Reduce can be found in Appendix A.2. Nowadays,
the trend everything as a service has been creating a Big Services era due to the foundational architecture of services
computing. And servicelization is the way of offering social networking services, big data analytics, and Internet services
[19] [20]. Thus the cloud computing tools aforementioned can be used to improve the scalability and efficiency of service
recommendation methods in the Big Data environment.
II. LITERATURE SURVEY
Cloud computing provides services to internet by utilizing resources of the computing infrastructure to provide different
services of the internet. It allows consumers and businesses to use applications without installation and access their
personal files at any computer with internet access. A distributed storage system for managing structured data at Google
called Bigtable. Bigtable is designed to reliably scale to petabytes of data and thousands of machines. Bigtable has
achieved several goals: wide applicability, scalability, high performance, and high availability. Bigtable is used by more
than sixty Google products and projects, including Google Analytics, Google Finance, Personalized Search, Google Earth
and many more. In this paper a review is done to analyze the cloud performance on data stored at data centers.[1]
In this paper, propose a contextual MAB(Multi-Armed Bandit algorithms) based clustering algorithm to design and
deploy recommender systems, in which both the contexts and the large item space are considered. To improve the
learning speed, we consider partitioning the item cluster tree into a set of clusters. The algorithm alternates between the
exploration and exploitation phases and aggregates the contextual information from a sublinear number of past arrived
contexts.[2]
Leveraging community imparted data for personalized recommendation is one of the active research problems since there
are rich contexts and human activities in such explosively growing data. We focus on personalized travel recommendation
and show promising applications. We conduct personalized travel recommendation by considering specific user profiles
or attributes. We propose a personalized travel recommendation model considering users’ attributes as well as their group
types and the knowledge mined from travel logs .We investigate the association of people attributes such as time, popular
landmarks, etc., We also recommend the nearby location suggestions in mobile using android.[3]
In this paper we present one such class of item-based recommendation algorithms that first determine the similarities
between the various items and then used them to identify the set of items to be recommended. The key steps in this class
of algorithms are (i) the method used to compute the similarity between the items, and (ii) the method used to combine
these similarities in order to compute the similarity between a basket of items and a candidate recommender item.[4]
In this paper, the recommendation techniques applied in Personal Program Guide (PPG). This is a system generating
personalized Electronic Program Guides for DigitalTV. The PPG manages a user model that stores the estimates of the
individual user’spreferences for TV program categories. This model results from the integration of different preference
acquisition modules that handle explicit user preferences, stereotypical information about TV viewers, and information
about the user’s viewing behavior. The observation of the individual viewing behavior is particularly easy because the
PPG runs on the set-top box and isdeeply integrated with the TV playing and the video recording services offered by that
type of device.[5]
III. PROBLEM DEFENITION
Implement Collaborative Filtering algorithm on mapreduce is adopted to generate appropriate recommendations. To
improve the scalability and efficiency of Keyword Service Recommendation Method in Bigdata Environment. Presenting
a personalized service recommendation list and recommending the most appropriate services to the users effectively.
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org
Page | 81
Paper Publications
IV. METHODOLOGY
In this paper, we propose a keyword-aware service recommendation method, named KASR. In this method, keywords are
used to indicate both of users' preferences and the quality of candidate services. A user-based CF algorithm is adopted to
generate appropriate recommendations. KASR aims at calculating a personalized rating of each candidate service for a
user, and then presenting a personalized service recommendation list and recommending the most appropriate services to
him/her. Moreover, to improve the scalability and efficiency of our recommendation method in “Big Data” environment,
we implement it in a MapReduce framework on Hadoop by splitting the proposed algorithm into multiple MapReduce
phases.
Table 1 summarizes the basic symbols and notations used in this paper.
Table 1 Basic symbols and notations
Symbols Definition
K Keyword-candidate list, K={k1,k2,…..kn}
APK The preference keyword set of the active user
PPK The preference keyword set of the previous user
Sim(APK,PPK) The similarity between APK and PPK
→
WP
A preference weight vector
→
WAP
A preference weight vector of active user
→
WPP
A preference weight vector of previous user
In our method, two data structures keyword-candidate list and specialized domain thesaurus are introduced to help obtain
users' preferences.
The keyword-candidate list is a set of keywords about users’ preferences and multi criteria of the candidate services,
which can be denoted as K={k1,k2,..kn} , n is the number of the keywords in the keyword-candidate list. An example of a
simple keyword-candidate list of the hotel reservation system is described in Table 2. Keywords in the keyword-candidate
list can be a word or multiple words related with the quality criteria of candidate services.
In this paper, the preferences of previous users will be extracted from their reviews for candidate services and formalized
into a keyword set. Usually, since some of words in reviews can not exactly match the corresponding keywords in the
keyword-candidate list which characterize the same aspects as the words. The corresponding key-words should be
extracted as well. In this paper, we assume that specialized domain thesauruses are built to sup-port the keyword
extraction, and different domain thesauruses are built for different service domains.
A domain thesaurus is a reference work of the keyword-candidate list that lists words grouped together according to the
similarity of key-word meaning, including related and contrasting words and antonyms [21] [22].
TABLE 2 Keyword-candidate list of hotel reservation system
No Keyword No Keyword No keyword
1 Service 7 Transportation 13 Airport
2 Rooom 8 Family,friends 14 Wi-fi
3 Shopping 9 Location 15 Environment
4 Cleanliness 10 Quite 16 Bar
5 Food 11 View 17 Beach
6 value 12 Fittness
A Keyword-aware Service Recommendation Method:
The main steps of KASR are depicted in Fig.1, which are described in detail as follows.
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org
Page | 82
Paper Publications
(1) Capture user preferences by a keyword-aware approach: In this step, the preferences of active users and previous
users are formalized into their corresponding preference keyword sets respectively. In this paper, an active user refers to a
current user needs recommendation.
Fig:1 KASR’s main steps
Preferences of an active user:
An active user can give his/her preferences about candidate services by selecting keywords from a keyword-candidate list,
which reflect the quality criteria of the services he/she is concerned about. The preference keyword set of the active user
can be denoted as APK={ak1,ak2,..akl} where aki (1≤i≤l) is ith
keyword selected from the keyword-candidate list by the
active user, l is the number of selected keywords.
Preferences of previous users:
The preferences of a previous user for a candidate service are extracted from his/her reviews for the service according to
the keyword-candidate list and domain thesaurus. And a review of the previous user will be formalized into the preference
key-word set of him/her, which can be denoted as ,PPK={pk1,pk2,…pkh} where pki (1≤i≤h) is the ith keyword extracted
from the review, h is the number of extracted keywords.
The keyword extraction process is described as follows:
a) Preprocess: Firstly, HTML tags and stop words in the reviews snippet collection should be removed to avoid affecting
the quality of the keyword extraction in the next stage. And the Porter Stemmer algorithm (keyword strip-ping) [23] is
used to remove the commoner morphological and in flexional endings from words in English. Its main use is as part of a
term normalization process that is usually done when setting up Information Retrieval systems [23].
b) Keyword extraction: In this phase, each review will be transformed into a corresponding keyword set according to the
keyword-candidate list and domain thesaurus. If the review contains a word in the domain thesaurus, then the
corresponding keyword should be extracted into the preference keyword set of the user. For example, if a review of a
previous user for a hotel has the word “spa”, which is corresponding to the keyword ``Fitness" in the domain thesaurus,
then the keyword ``Fitness" should be contained in the preference keyword set of the previous user. If a keyword appears
more than once in a review, the times of repetitions will be recorded. In this paper, it is regarded that keywords appearing
multiple times are more important. The times of repetitions will be used to calculate the weight of the keyword in
preference keyword set in the next step.
(2) Similarity computation: The second step is to identify the reviews of previous users who have similar tastes to an
active user by finding neighborhoods of the active user based on the similarity of their preferences. Before similarity
computation, the reviews unrelated to the active user's preferences will be filtered out by the intersection concept in set
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org
Page | 83
Paper Publications
theory. If the intersection of the preference keyword sets of the active user and a previous user is an empty set, then the
preference keyword set of the previous user will be filtered out.
Two similarity computation methods:
a) Approximate similarity computation:
The approximate similarity computation method is for the case that the weights of the keywords in the preference
keyword set are unavailable.
A frequently used method for comparing the similarity and diversity of sample sets, Jaccard coefficient, is applied in the
approximate similarity computation. Jaccard coefficient is measurement of asymmetric information on binary (and non-
binary) variables, and it is useful when negative values give no information. The similarity between the preferences of the
active user and a previous user based on Jaccard coefficient is described as follows:
|APKᴖPPK|
Sim(APK, PPK)=Jaccard(APK,PPK)=
|APKᴗPPK| (1)
Where APK is the preference keyword set of the active user, PPK is the preference keyword set of a previous user. And
the weight of the keywords is not considered in this approach.
Algorithm 1, SIM-ASC, illustrates the functionality of the approximate similarity computation method.
Algorithm1: SIM-ASC
Input: The preference keyword set of the active
user APK The preference keyword set of
a previous user PPKj
Output: The similarity of APK and
PPKj,simASC(APK PPKj)
|APKᴖPPKj|
1. simASC(APK PPKj)=
|APKᴗPPKj|
2. Return the similarity of APK and PPKj, simASC(APK PPKj)
b) Exact similarity computation:
The exact similarity computation method is for the case that the weight of the keywords are available.
A cosine-based approach is applied in the exact similarity computation, which is similar to the Vector Space Model
(VSM) in information retrieval [24] [25]. In this cosine-based approach, The preference keyword sets of the active user
and previous users will be transformed into n-dimensional weight vectors respectively, namely preference weight vector,
which can be denoted as WP , n is the number of keywords in the keyword-candidate list, is the weight of the keyword ki
in the keyword-candidate list. If the keyword ki is not contained in the preference keyword set, then the weight of ki in the
preference weight vector is 0, i.e., wi=0. The preference weight vectors of the active user and a previous user are noted as
→ and → , respectively.
WAP WPP
In this paper, we use the Analytic Hierarchy Process (AHP) model to decide the weight of the keywords in the preference
keyword set of the active user. AHP method is provided by Saaty in 1970s to choose the best satisfied business role for its
hierarchy nature [26]. The weight computing based on the AHP model is decied as follows:
Firstly, we construct the pair-wise comparison matrix in terms of the relative importance between each two key-words.
The pair-wise comparison matrix Am = (aij) m must satisfy the following properties, aij represents the relative importance
of two keywords:
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org
Page | 84
Paper Publications
1) aij =1, i=j=1,2,3…………m
2) aij=1/ aij, i=j=1,2,3…………m and i≠j
3) aij= aik/ ajk , i=j=k=1,2,3…….m and i≠j
After checking the consistence of the matrix, then we calculate the weight by the following function:
Wi ∑ ∑ (2)
Where aij is the relative importance between two keywords, m is the number of the keywords in the preference keyword
set of the active user
The weight vector of the preference keyword set of a previous user can be decided by the term frequency/inverse
document frequency (TF-IDF) measure [27], which is one of the best-known measures for specifying the weight of key-
words in Information Retrieval.
In the TF-IDF approach, to calculate the preference weight vector of a previous user u', all reviews by user u' should be
collected. Here, all reviews contain the reviews by user u' for the candidate services and similar ser-vices not in the
candidate services. The reviews should also be transformed into keyword sets respectively according to the keyword-
candidate list and the domain thesaurus.
TF, the term frequency of the keyword pki in the preference keyword set of user u' is defined as
TF =Npki / ∑ 3)
Where is the number of occurrences of the keyword pki in all the keyword sets of the reviews commented by the same
user u', g is the number of the keywords in the preference keyword set of the user u'.
The inverse document frequency (IDF) is obtained by dividing the number of all reviews by the number of reviews
containing the keyword pki.
IDF = log |R' | / |r': pki∈ r'| (4)
Where |R'| is the total number of the reviews commented by user u’, and |r': pki∈ r'| is the number of reviews where
keyword pki appears. So the TF-IDF weight of the keyword pki in the preference keyword set of user u' can be decided by
the following function:
wpki=TF×IDF= (Npki / ∑ ) log |R' | / |r': pki∈ r'|) (5)
Then the similarity based on the cosine-based approach is defined as follows:
→ →
→ → wAP wPP
Sim(APK, PPK)=cos(wAP, wPP) =
→ →
wAP||2 ||wPP||2
→ →
∑
=
→ → (6)
∑ ∑
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org
Page | 85
Paper Publications
→ →
Where wAP and wPP are respectively the preference weight vectors of the active user and a previous user. Vector wAP,i is
the i-th dimension of vector wAP and represents the weight of the keyword ki in preference keyword set APK,vector wpp,i is
the i-th dimension of vector wpp and represents the weight of the keyword ki in preference keyword set PPK.
Algorithm 2, SIM-ESC, illustrates the functionality of the exact similarity computation method.
Algorithm 2: SIM-ESC (Exact Similarity Computation )
Input: The preference keyword set of the active user APK
The preference keyword set of a previous user PPKj.
Output: The similarity of APK and PPKj,
simESC(APK,PPKj).
1: for each keyword ki in the keyword-candidate list
2: if ki APK then
3: get → by formula(2)
WAP,i
4: else →
WAP,i = 0
5: end if
6: if ki PPKj then
7: get → by formula (5)
Wpp,i
8: else →
WPP,i = 0
9: end if
10: end for
11: get simESC(APK,PPKj).by formula(6)
12: return the similarity of APK and PPKj
simESC(APK,PPKj).
(3) Calculate personalized ratings and generate recom-mendations:
Based on the similarity of the active user and previous us-ers, further filtering will be conducted. Given a threshold δ, if
sim(APK,PPKj)< δ, the preference keyword set of a pre-vious user PPKj will be filtered out, otherwise PPKj will be
retained.
Finally, a personalized service recommendation list will be presented to the user and the ser-vice(s) with the highest
rating(s) will be recommended to him/her.
We use a weighted average approach to calculate the personalized rating pr of a service for the active user.
pr = r + k ∑
k = 1/∑
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org
Page | 86
Paper Publications
where sim(APK,PPKj) is the similarity of the preference keyword set of the active user and the preference keyword set of
a previous user PPKj; multiplier k serves as a normalizing factor, ˆR( R cap) denotes the set of the remaining preference
keyword sets of previous users after filtering; rj is the rating of the corresponding review of PPKj, and r bar is defined as
the average ratings of the candidate service.
we can calculate the personalized ratings of all candidate services for the active user. Then we can rank the services by the
personalized ratings and present a personalized service recommendation list to him/her. we assume that the services with
higher ratings are more preferable to the user. So the service(s) with the highest rating(s) will be recommended to the
active user. we can recommend the Top-K services to the user.
Algorithm 3: Basic Algorithm of KASR
Input: The preference keyword set of the active user APK
The candidate services WS={ws1,ws2,……wsN}
The threshold δ in the filtering phase The number K .
Output: The services with the Top-K highest ratings
{tws1, tws2, …, twsK}.
1: for each service wsi WS
2: ˆR = Ф sum = 0, r = 0
3: for each review Rj of service wsi
4: process the review into a preference keyword set PPKj
5: if PPKj ᴖ APK = Ф then
6: insert PPKj into ˆR
7: end if
8: end for
9: for each keyword set PPK j ˆR
10:
11: then
12: remove PPKj from ˆR
13: else sum=sum+1, r=r+rj
14: end if
15: end for
16: r = r/sum
17: get pri by formula (7)
18: end for
19: sort the services according to the personalized ratings pri
20: return the services with the Top-K highest ratings
{tws1, tws2… twsk}
Algorithm 3 illustrates the basic algorithm of KASR. The input contains the preference keyword set of the active user
APK, the candidate services WS={ws1,ws2 ,…wsN} the threshold δ in the filtering phase, and the number K. In line 2, ˆR
is used to store the remaining preference keyword sets of previous users, and sum is to record the number of the remaining
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org
Page | 87
Paper Publications
preference keyword sets of previous users. Line 3 to line 8 is used to process each review of the previous users into the
corresponding preference keyword sets, and then do a simple filtering to filter out the reviews unrelated with the active
user's preferences. Line 9 to line 15 are to calculate the similarity of APK and PPKj, and then filter out the keyword set
PPKj whose similarity with APK is less than the threshold δ.
In this paper, there are two methods to calculate the similarity: approximate similarity computation (see Algorithm 1) and
exact similarity computation (see Algorithm 2). Line 16 to line 18 is to calculate the personalized ratings of the candidate
services for the active user. Finally, line 19 and line 20 is to sort the candidate services according to the personalized
ratings and recommend the services with the Top-K highest ratings to the active user.
V. RESULTS AND DISCUSSION
To verify the scalability of KASR, experiment is conducted respectively in a cluster of nodes ranging from 1 to 8. There
are 4 synthetic datasets used in the experiments (128M, 256M, 512M and 1G datasize). Fig. 2 shows the speedup of
KASR (Here, KASR-ESC method is adopted in the scalability experiment). From Fig. 2, we can see that the speedup of
KASR increases relative linearly with the growth of the number of nodes. Meanwhile, larger dataset obtained a better
speedup. When the datasize is 1G and the number of nodes is 8, the speedup value reaches 6.412, which is 80.15%
(6.412/8=80.15%) of the idealspeedup. The experimental result shows that KASR on Map-Reduce in Hadoop platform
has good scalability over Big Data and performs better with larger dataset.
Fig. 2 Speedup of KASR
Overall, these experimental results show that KASR per-forms well in accuracy and KASR on Mapreduce framework has
good scalability in Big Data environment.
VI. CONCLUSION
In this paper presenting a personalized service recommendation list and recommending the most appropriate service(s) to
the users. Moreover, to improve the scalability and efficiency of KASR in Big Data environment, we have implemented it
on a MapReduce framework in Hadoop platform. Finally, the experimental results demonstrate that KASR significantly
improves the accuracy and scalability of service recommender systems over existing approaches.
ISSN 2350-1022
International Journal of Recent Research in Mathematics Computer Science and Information Technology
Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org
Page | 88
Paper Publications
VII. FUTURE SCOPE
Implement Collaborative filtering algorithm on mapreduce is adopted to generate appropriate recommendations. To
improve the scalability and efficiency of Keyword Service Recommendation Method in Bigdata Environment,to
distinguish the positive and negative preferences of the users from their reviews to make the predictions more accurate.
REFERENCES
[1] Key Word Aware Service Recommendaion Method.
[2] C. Lynch, “Big Data: How do your data grow?” Nature, Vol. 455, No. 7209, pp. 28-29, 2008..
[3] F. Chang, J. Dean, S. Ghemawat, and W. C. Hsieh, “Bigtable: A distributed storage system for structured data,”
ACM Transactions on Computer Systems,Vol. 26, No. 2 (4) , 2008.
[4] W. Dou, X. Zhang, J. Liu, J. Chen, “HireSome-II: Towards Privacy-Aware Cross-Cloud Service Composition for
Big Data Applica-tions,” IEEE Transactions on Parallel and Distributed Systems, 2013.
[5] G. Linden, B. Smith, and J. York, “Amazon.com Recommendations: Item-to-Item Collaborative Filtering,” IEEE
Internet Computing, Vol. 7, No.1, pp. 76-80, 2003.
[6] M. Bjelica, “Towards TV Recommender System Experiments with User Modeling,” IEEE Transactions on
Consumer Electronics, Vol. 56, No.3, pp. 1763-1769, 2010.
[7] M. Alduan, F. Alvarez, J. Menendez, and O. Baez, “Recommender System for Sport Videos Based on User
Audiovisual Consumption,” IEEE Transactions on Multimedia, Vol. 14, No.6, pp. 1546-1557, 2013.
[8] Y. Chen, A. Cheng and W. Hsu, “Travel Recommendation by Mining People Attributes and Travel Group Types
From Community-Contributed Photos”. IEEE Transactions on Multimedia, Vol. 25, No.6, pp. 1283-1295,
2012.www.ijecse.org.
Author’s Profile:
Dayanand Bhovi completed the bachelors degree in Informaton Science &Engineering from University
of VTU and presently pursuing Masters in Engineering in Computer Science & Engineering at Mangalore
Institute of Technology, Mangalore.
Ashwin kumar completed bachelors and masters degree in Computer Science and Engineering. Currently
working as Sr. Assistant professor in Mangalore Institute Technology, Mangalore.

More Related Content

What's hot

A Result on Novel Approach for Load Balancing in Cloud Computing
A Result on Novel Approach for Load Balancing in Cloud ComputingA Result on Novel Approach for Load Balancing in Cloud Computing
A Result on Novel Approach for Load Balancing in Cloud Computingijtsrd
 
A Generic Model for Student Data Analytic Web Service (SDAWS)
A Generic Model for Student Data Analytic Web Service (SDAWS)A Generic Model for Student Data Analytic Web Service (SDAWS)
A Generic Model for Student Data Analytic Web Service (SDAWS)Editor IJCATR
 
The NIH Data Commons - BD2K All Hands Meeting 2015
The NIH Data Commons -  BD2K All Hands Meeting 2015The NIH Data Commons -  BD2K All Hands Meeting 2015
The NIH Data Commons - BD2K All Hands Meeting 2015Vivien Bonazzi
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands Vivien Bonazzi
 
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...IJCSIS Research Publications
 
IRJET- A Scenario on Big Data
IRJET- A Scenario on Big DataIRJET- A Scenario on Big Data
IRJET- A Scenario on Big DataIRJET Journal
 
On Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a ServiceOn Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a ServiceHong-Linh Truong
 
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET Journal
 
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...IRJET Journal
 
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdf
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdfbig-data-analytics-and-iot-in-logistics-a-case-study-2018.pdf
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdfAkuhuruf
 
An efficient resource sharing technique for multi-tenant databases
An efficient resource sharing technique for multi-tenant databases An efficient resource sharing technique for multi-tenant databases
An efficient resource sharing technique for multi-tenant databases IJECEIAES
 
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATA
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATAA REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATA
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATAIJMIT JOURNAL
 
Delivering IT as A Utility- A Systematic Review
Delivering IT as A Utility- A Systematic ReviewDelivering IT as A Utility- A Systematic Review
Delivering IT as A Utility- A Systematic Reviewijfcstjournal
 
Big dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyondBig dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyondPatrick Bouillaud
 
Effective Cross-Domain Collaborative Filtering using Temporal Domain – A Brie...
Effective Cross-Domain Collaborative Filtering using Temporal Domain – A Brie...Effective Cross-Domain Collaborative Filtering using Temporal Domain – A Brie...
Effective Cross-Domain Collaborative Filtering using Temporal Domain – A Brie...IJMTST Journal
 
Information economics and big data
Information economics and big dataInformation economics and big data
Information economics and big dataMark Albala
 
NIH Data Commons - Note: Presentation has animations
NIH Data Commons  - Note:  Presentation has animations NIH Data Commons  - Note:  Presentation has animations
NIH Data Commons - Note: Presentation has animations Vivien Bonazzi
 

What's hot (20)

A Result on Novel Approach for Load Balancing in Cloud Computing
A Result on Novel Approach for Load Balancing in Cloud ComputingA Result on Novel Approach for Load Balancing in Cloud Computing
A Result on Novel Approach for Load Balancing in Cloud Computing
 
A Generic Model for Student Data Analytic Web Service (SDAWS)
A Generic Model for Student Data Analytic Web Service (SDAWS)A Generic Model for Student Data Analytic Web Service (SDAWS)
A Generic Model for Student Data Analytic Web Service (SDAWS)
 
The NIH Data Commons - BD2K All Hands Meeting 2015
The NIH Data Commons -  BD2K All Hands Meeting 2015The NIH Data Commons -  BD2K All Hands Meeting 2015
The NIH Data Commons - BD2K All Hands Meeting 2015
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
 
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
Optimizing Bigdata Processing by using Hybrid Hierarchically Distributed Data...
 
IRJET- A Scenario on Big Data
IRJET- A Scenario on Big DataIRJET- A Scenario on Big Data
IRJET- A Scenario on Big Data
 
On Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a ServiceOn Evaluating and Publishing Data Concerns for Data as a Service
On Evaluating and Publishing Data Concerns for Data as a Service
 
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
 
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
 
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdf
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdfbig-data-analytics-and-iot-in-logistics-a-case-study-2018.pdf
big-data-analytics-and-iot-in-logistics-a-case-study-2018.pdf
 
R180305120123
R180305120123R180305120123
R180305120123
 
An efficient resource sharing technique for multi-tenant databases
An efficient resource sharing technique for multi-tenant databases An efficient resource sharing technique for multi-tenant databases
An efficient resource sharing technique for multi-tenant databases
 
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATA
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATAA REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATA
A REVIEW ON CLASSIFICATION OF DATA IMBALANCE USING BIGDATA
 
Delivering IT as A Utility- A Systematic Review
Delivering IT as A Utility- A Systematic ReviewDelivering IT as A Utility- A Systematic Review
Delivering IT as A Utility- A Systematic Review
 
Big dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyondBig dataimplementation hadoop_and_beyond
Big dataimplementation hadoop_and_beyond
 
Ck34520526
Ck34520526Ck34520526
Ck34520526
 
Effective Cross-Domain Collaborative Filtering using Temporal Domain – A Brie...
Effective Cross-Domain Collaborative Filtering using Temporal Domain – A Brie...Effective Cross-Domain Collaborative Filtering using Temporal Domain – A Brie...
Effective Cross-Domain Collaborative Filtering using Temporal Domain – A Brie...
 
Information economics and big data
Information economics and big dataInformation economics and big data
Information economics and big data
 
NIH Data Commons - Note: Presentation has animations
NIH Data Commons  - Note:  Presentation has animations NIH Data Commons  - Note:  Presentation has animations
NIH Data Commons - Note: Presentation has animations
 
My Research Interests
My Research InterestsMy Research Interests
My Research Interests
 

Viewers also liked

Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...
Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...
Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...paperpublications3
 
Review Paper on an Open Source Content Management System: Joomla CMS
Review Paper on an Open Source Content Management System: Joomla CMSReview Paper on an Open Source Content Management System: Joomla CMS
Review Paper on an Open Source Content Management System: Joomla CMSpaperpublications3
 
MUTATION AND CROSSOVER ISSUES FOR OSN PRIVACY
MUTATION AND CROSSOVER ISSUES FOR OSN PRIVACYMUTATION AND CROSSOVER ISSUES FOR OSN PRIVACY
MUTATION AND CROSSOVER ISSUES FOR OSN PRIVACYpaperpublications3
 
корисні інструменти для проектної роботи
корисні інструменти для проектної роботикорисні інструменти для проектної роботи
корисні інструменти для проектної роботиYuliya Troyan
 
English reader's vocabulary the initial
English reader's vocabulary the initialEnglish reader's vocabulary the initial
English reader's vocabulary the initialCelia Koutrafouri
 
Dialecticvs nvncivs, postmarxismo, post marxismo, post marxismo, post-marxism...
Dialecticvs nvncivs, postmarxismo, post marxismo, post marxismo, post-marxism...Dialecticvs nvncivs, postmarxismo, post marxismo, post marxismo, post-marxism...
Dialecticvs nvncivs, postmarxismo, post marxismo, post marxismo, post-marxism...UNIVERSITY OF COIMBRA
 
Eugene Hanes Resume *
Eugene Hanes Resume *Eugene Hanes Resume *
Eugene Hanes Resume *Eugene Hanes
 
Why is the world green-draft3
Why is the world green-draft3Why is the world green-draft3
Why is the world green-draft3Robert Gilson
 
Liberty university hius 221 module week 6 mindtap activities complete solutio...
Liberty university hius 221 module week 6 mindtap activities complete solutio...Liberty university hius 221 module week 6 mindtap activities complete solutio...
Liberty university hius 221 module week 6 mindtap activities complete solutio...Kelley King
 
Identificacion estilos de aprendizaje
Identificacion estilos de aprendizajeIdentificacion estilos de aprendizaje
Identificacion estilos de aprendizajeIvonnLopez53
 
2011 argyle-technical-manual
2011 argyle-technical-manual2011 argyle-technical-manual
2011 argyle-technical-manualMaxim Bazium
 
Trabajo Tenerife-Marte. Jaime Landa. 1ºC
Trabajo Tenerife-Marte. Jaime Landa. 1ºCTrabajo Tenerife-Marte. Jaime Landa. 1ºC
Trabajo Tenerife-Marte. Jaime Landa. 1ºCsteve rogers
 
Building Communities that Cares through Volunteerism
Building Communities that Cares through VolunteerismBuilding Communities that Cares through Volunteerism
Building Communities that Cares through Volunteerismzacharia mhuruyengwe
 
Технологія приготування бутербродів
Технологія приготування бутербродівТехнологія приготування бутербродів
Технологія приготування бутербродівTanya Krasko
 

Viewers also liked (20)

Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...
Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...
Implementing Map Reduce Based Edmonds-Karp Algorithm to Determine Maximum Flo...
 
Review Paper on an Open Source Content Management System: Joomla CMS
Review Paper on an Open Source Content Management System: Joomla CMSReview Paper on an Open Source Content Management System: Joomla CMS
Review Paper on an Open Source Content Management System: Joomla CMS
 
Aflatoxicosis in Poultry
Aflatoxicosis in PoultryAflatoxicosis in Poultry
Aflatoxicosis in Poultry
 
MUTATION AND CROSSOVER ISSUES FOR OSN PRIVACY
MUTATION AND CROSSOVER ISSUES FOR OSN PRIVACYMUTATION AND CROSSOVER ISSUES FOR OSN PRIVACY
MUTATION AND CROSSOVER ISSUES FOR OSN PRIVACY
 
корисні інструменти для проектної роботи
корисні інструменти для проектної роботикорисні інструменти для проектної роботи
корисні інструменти для проектної роботи
 
CV2017COMMKTAI_sp_eng
CV2017COMMKTAI_sp_engCV2017COMMKTAI_sp_eng
CV2017COMMKTAI_sp_eng
 
Matéria locadores de equipamentos
Matéria locadores de equipamentosMatéria locadores de equipamentos
Matéria locadores de equipamentos
 
P11 PHP 20017 SIGNE
P11 PHP 20017 SIGNEP11 PHP 20017 SIGNE
P11 PHP 20017 SIGNE
 
English reader's vocabulary the initial
English reader's vocabulary the initialEnglish reader's vocabulary the initial
English reader's vocabulary the initial
 
Dialecticvs nvncivs, postmarxismo, post marxismo, post marxismo, post-marxism...
Dialecticvs nvncivs, postmarxismo, post marxismo, post marxismo, post-marxism...Dialecticvs nvncivs, postmarxismo, post marxismo, post marxismo, post-marxism...
Dialecticvs nvncivs, postmarxismo, post marxismo, post marxismo, post-marxism...
 
Eugene Hanes Resume *
Eugene Hanes Resume *Eugene Hanes Resume *
Eugene Hanes Resume *
 
Why is the world green-draft3
Why is the world green-draft3Why is the world green-draft3
Why is the world green-draft3
 
Liberty university hius 221 module week 6 mindtap activities complete solutio...
Liberty university hius 221 module week 6 mindtap activities complete solutio...Liberty university hius 221 module week 6 mindtap activities complete solutio...
Liberty university hius 221 module week 6 mindtap activities complete solutio...
 
Identificacion estilos de aprendizaje
Identificacion estilos de aprendizajeIdentificacion estilos de aprendizaje
Identificacion estilos de aprendizaje
 
2011 argyle-technical-manual
2011 argyle-technical-manual2011 argyle-technical-manual
2011 argyle-technical-manual
 
Gabriel maldonado pumarejo
Gabriel maldonado pumarejoGabriel maldonado pumarejo
Gabriel maldonado pumarejo
 
Trabajo Tenerife-Marte. Jaime Landa. 1ºC
Trabajo Tenerife-Marte. Jaime Landa. 1ºCTrabajo Tenerife-Marte. Jaime Landa. 1ºC
Trabajo Tenerife-Marte. Jaime Landa. 1ºC
 
Building Communities that Cares through Volunteerism
Building Communities that Cares through VolunteerismBuilding Communities that Cares through Volunteerism
Building Communities that Cares through Volunteerism
 
Технологія приготування бутербродів
Технологія приготування бутербродівТехнологія приготування бутербродів
Технологія приготування бутербродів
 
El libro
El libroEl libro
El libro
 

Similar to Improving Service Recommendation Method on Map reduce by User Preferences and Reviews

Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsIRJET Journal
 
IRJET- An Integrated Recommendation System using Graph Database and QGIS
IRJET-  	  An Integrated Recommendation System using Graph Database and QGISIRJET-  	  An Integrated Recommendation System using Graph Database and QGIS
IRJET- An Integrated Recommendation System using Graph Database and QGISIRJET Journal
 
Recommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceRecommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceIJDKP
 
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...IRJET Journal
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...IJTET Journal
 
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET Journal
 
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...IRJET Journal
 
Big Data Analytics in the Cloud for Business Intelligence.docx
Big Data Analytics in the Cloud for Business Intelligence.docxBig Data Analytics in the Cloud for Business Intelligence.docx
Big Data Analytics in the Cloud for Business Intelligence.docxVENKATAAVINASH10
 
Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsSurvey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsIRJET Journal
 
Recommender systems: a novel approach based on singular value decomposition
Recommender systems: a novel approach based on singular  value decompositionRecommender systems: a novel approach based on singular  value decomposition
Recommender systems: a novel approach based on singular value decompositionIJECEIAES
 
Machine learning based recommender system for e-commerce
Machine learning based recommender system for e-commerceMachine learning based recommender system for e-commerce
Machine learning based recommender system for e-commerceIAESIJAI
 
Big data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docxBig data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docxhartrobert670
 
Providing Highly Accurate Service Recommendation over Big Data using Adaptive...
Providing Highly Accurate Service Recommendation over Big Data using Adaptive...Providing Highly Accurate Service Recommendation over Big Data using Adaptive...
Providing Highly Accurate Service Recommendation over Big Data using Adaptive...IRJET Journal
 
User Preferences Based Recommendation System for Services using Mapreduce App...
User Preferences Based Recommendation System for Services using Mapreduce App...User Preferences Based Recommendation System for Services using Mapreduce App...
User Preferences Based Recommendation System for Services using Mapreduce App...IJMTST Journal
 
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...IRJET Journal
 
A Survey on Recommendation System based on Knowledge Graph and Machine Learning
A Survey on Recommendation System based on Knowledge Graph and Machine LearningA Survey on Recommendation System based on Knowledge Graph and Machine Learning
A Survey on Recommendation System based on Knowledge Graph and Machine LearningIRJET Journal
 
Time-Ordered Collaborative Filtering for News Recommendation
Time-Ordered Collaborative Filtering for News RecommendationTime-Ordered Collaborative Filtering for News Recommendation
Time-Ordered Collaborative Filtering for News RecommendationIRJET Journal
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
Full Paper: Analytics: Key to go from generating big data to deriving busines...
Full Paper: Analytics: Key to go from generating big data to deriving busines...Full Paper: Analytics: Key to go from generating big data to deriving busines...
Full Paper: Analytics: Key to go from generating big data to deriving busines...Piyush Malik
 
Safe Drinking Water In Bangladesh Essay
Safe Drinking Water In Bangladesh EssaySafe Drinking Water In Bangladesh Essay
Safe Drinking Water In Bangladesh EssaySusan Cox
 

Similar to Improving Service Recommendation Method on Map reduce by User Preferences and Reviews (20)

Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
 
IRJET- An Integrated Recommendation System using Graph Database and QGIS
IRJET-  	  An Integrated Recommendation System using Graph Database and QGISIRJET-  	  An Integrated Recommendation System using Graph Database and QGIS
IRJET- An Integrated Recommendation System using Graph Database and QGIS
 
Recommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceRecommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduce
 
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
 
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
An Improvised Fuzzy Preference Tree Of CRS For E-Services Using Incremental A...
 
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
 
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
 
Big Data Analytics in the Cloud for Business Intelligence.docx
Big Data Analytics in the Cloud for Business Intelligence.docxBig Data Analytics in the Cloud for Business Intelligence.docx
Big Data Analytics in the Cloud for Business Intelligence.docx
 
Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning AlgorithmsSurvey on MapReduce in Big Data Clustering using Machine Learning Algorithms
Survey on MapReduce in Big Data Clustering using Machine Learning Algorithms
 
Recommender systems: a novel approach based on singular value decomposition
Recommender systems: a novel approach based on singular  value decompositionRecommender systems: a novel approach based on singular  value decomposition
Recommender systems: a novel approach based on singular value decomposition
 
Machine learning based recommender system for e-commerce
Machine learning based recommender system for e-commerceMachine learning based recommender system for e-commerce
Machine learning based recommender system for e-commerce
 
Big data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docxBig data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docx
 
Providing Highly Accurate Service Recommendation over Big Data using Adaptive...
Providing Highly Accurate Service Recommendation over Big Data using Adaptive...Providing Highly Accurate Service Recommendation over Big Data using Adaptive...
Providing Highly Accurate Service Recommendation over Big Data using Adaptive...
 
User Preferences Based Recommendation System for Services using Mapreduce App...
User Preferences Based Recommendation System for Services using Mapreduce App...User Preferences Based Recommendation System for Services using Mapreduce App...
User Preferences Based Recommendation System for Services using Mapreduce App...
 
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
 
A Survey on Recommendation System based on Knowledge Graph and Machine Learning
A Survey on Recommendation System based on Knowledge Graph and Machine LearningA Survey on Recommendation System based on Knowledge Graph and Machine Learning
A Survey on Recommendation System based on Knowledge Graph and Machine Learning
 
Time-Ordered Collaborative Filtering for News Recommendation
Time-Ordered Collaborative Filtering for News RecommendationTime-Ordered Collaborative Filtering for News Recommendation
Time-Ordered Collaborative Filtering for News Recommendation
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Full Paper: Analytics: Key to go from generating big data to deriving busines...
Full Paper: Analytics: Key to go from generating big data to deriving busines...Full Paper: Analytics: Key to go from generating big data to deriving busines...
Full Paper: Analytics: Key to go from generating big data to deriving busines...
 
Safe Drinking Water In Bangladesh Essay
Safe Drinking Water In Bangladesh EssaySafe Drinking Water In Bangladesh Essay
Safe Drinking Water In Bangladesh Essay
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightSafe Software
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....rightmanforbloodline
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformWSO2
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfdanishmna97
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data SciencePaolo Missier
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 

Recently uploaded (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

Improving Service Recommendation Method on Map reduce by User Preferences and Reviews

  • 1. ISSN 2350-1022 International Journal of Recent Research in Mathematics Computer Science and Information Technology Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org Page | 79 Paper Publications Improving Service Recommendation Method on Map reduce by User Preferences and Reviews 1 Dayanand Bhovi, 2 Mr.Ashwin Kumar 2 Sr. Assistant professor, 1,2 Dpt. Of Computer Science & Engg. Mangalore Institute of Technology and Engineering Mangalore, Karnataka, India Abstract: Service recommender systems have been shown as valuable tools for providing appropriate recommendations to users. In the last decade, the amount of customers, services and online information has grown rapidly, yielding the big data analysis problem for service recommender systems. Consequently, traditional service recommender systems often suffer from scalability and inefficiency problems most of existing service recommender systems present the same ratings and rankings of services to different users without considering diverse users' preferences, and therefore fails to meet users' personalized requirements. In this paper, to address the above challenges and presenting a personalized service recommendation list and recommending the most appropriate services to the users effectively. Specifically, keywords are used to indicate users' preferences, and a user-based Collaborative filtering algorithm is adopted to generate appropriate recommendations. Keywords: recommender system, user preference, keyword, Big Data, mapreduce, Hadoop. I. INTRODUCTION In recent years, the amount of data in our world has been increasing explosively, and analyzing large data sets—so-called “Big Data”— becomes a key basis of competition underpinning new waves of productivity growth, innovation, and consumer surplus [1]. Then, what is “Big Data”?, Big Data refers to datasets whose size is beyond the ability of current technology, method and theory to capture, man-age, and process the data within a tolerable elapsed time. Today, Big Data management stands out as a challenge for IT companies. The solution to such a challenge is shifting increasingly from providing hardware to provisioning more manageable software solutions [2]. Big Data also brings new opportunities and critical challenges to industry and academia [3] [4].Similar to most big data applications, the big data tendency also poses heavy impacts on service recommender systems. With the growing number of alternative services, effectively recommending services that users preferred has become an important research issue. Service recommender systems have been shown as valuable tools to help users deal with services overload and provide appropriate recommendations to them. Examples of such practical applications include CDs, books, web pages and various other products now use recommender systems [5], [6], [7]. Over the last decade, there has been much research done both in industry and academia on developing new approaches for service recommender systems [8], [9]. Cloud Computing and Map Reduce: Cloud computing is a successful paradigm of service oriented computing and has revolutionized the way computing infrastructure is abstracted and used. The major goal of cloud computing is to share resources, such as infrastructure, platform, software, and business process [14].
  • 2. ISSN 2350-1022 International Journal of Recent Research in Mathematics Computer Science and Information Technology Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org Page | 80 Paper Publications Cloud computing can provide effective platforms to facilitate parallel computing, which has gained significant attention in recent years to process large volume of data. There are several cloud computing tools available, such as Hadoop (http://hadoop.apache.org/), Mahout (http://mahout.apache.org/), MapReduce of Google [15], the Dynamo of Amazon.com [16], the Dryad of Microsoft and Neptune of Ask.com [17], etc. Among these tools, Hadoop is the most popular open source cloud computing platform inspired by Map Reduce and Google File System papers [18], which supports Map Reduce programming framework and mass data storage with good fault tolerance. Map Reduce is a popular distributed implementation model proposed by Google, which is inspired by map and reduce operations in the Lisp programming language. More details about Map Reduce can be found in Appendix A.2. Nowadays, the trend everything as a service has been creating a Big Services era due to the foundational architecture of services computing. And servicelization is the way of offering social networking services, big data analytics, and Internet services [19] [20]. Thus the cloud computing tools aforementioned can be used to improve the scalability and efficiency of service recommendation methods in the Big Data environment. II. LITERATURE SURVEY Cloud computing provides services to internet by utilizing resources of the computing infrastructure to provide different services of the internet. It allows consumers and businesses to use applications without installation and access their personal files at any computer with internet access. A distributed storage system for managing structured data at Google called Bigtable. Bigtable is designed to reliably scale to petabytes of data and thousands of machines. Bigtable has achieved several goals: wide applicability, scalability, high performance, and high availability. Bigtable is used by more than sixty Google products and projects, including Google Analytics, Google Finance, Personalized Search, Google Earth and many more. In this paper a review is done to analyze the cloud performance on data stored at data centers.[1] In this paper, propose a contextual MAB(Multi-Armed Bandit algorithms) based clustering algorithm to design and deploy recommender systems, in which both the contexts and the large item space are considered. To improve the learning speed, we consider partitioning the item cluster tree into a set of clusters. The algorithm alternates between the exploration and exploitation phases and aggregates the contextual information from a sublinear number of past arrived contexts.[2] Leveraging community imparted data for personalized recommendation is one of the active research problems since there are rich contexts and human activities in such explosively growing data. We focus on personalized travel recommendation and show promising applications. We conduct personalized travel recommendation by considering specific user profiles or attributes. We propose a personalized travel recommendation model considering users’ attributes as well as their group types and the knowledge mined from travel logs .We investigate the association of people attributes such as time, popular landmarks, etc., We also recommend the nearby location suggestions in mobile using android.[3] In this paper we present one such class of item-based recommendation algorithms that first determine the similarities between the various items and then used them to identify the set of items to be recommended. The key steps in this class of algorithms are (i) the method used to compute the similarity between the items, and (ii) the method used to combine these similarities in order to compute the similarity between a basket of items and a candidate recommender item.[4] In this paper, the recommendation techniques applied in Personal Program Guide (PPG). This is a system generating personalized Electronic Program Guides for DigitalTV. The PPG manages a user model that stores the estimates of the individual user’spreferences for TV program categories. This model results from the integration of different preference acquisition modules that handle explicit user preferences, stereotypical information about TV viewers, and information about the user’s viewing behavior. The observation of the individual viewing behavior is particularly easy because the PPG runs on the set-top box and isdeeply integrated with the TV playing and the video recording services offered by that type of device.[5] III. PROBLEM DEFENITION Implement Collaborative Filtering algorithm on mapreduce is adopted to generate appropriate recommendations. To improve the scalability and efficiency of Keyword Service Recommendation Method in Bigdata Environment. Presenting a personalized service recommendation list and recommending the most appropriate services to the users effectively.
  • 3. ISSN 2350-1022 International Journal of Recent Research in Mathematics Computer Science and Information Technology Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org Page | 81 Paper Publications IV. METHODOLOGY In this paper, we propose a keyword-aware service recommendation method, named KASR. In this method, keywords are used to indicate both of users' preferences and the quality of candidate services. A user-based CF algorithm is adopted to generate appropriate recommendations. KASR aims at calculating a personalized rating of each candidate service for a user, and then presenting a personalized service recommendation list and recommending the most appropriate services to him/her. Moreover, to improve the scalability and efficiency of our recommendation method in “Big Data” environment, we implement it in a MapReduce framework on Hadoop by splitting the proposed algorithm into multiple MapReduce phases. Table 1 summarizes the basic symbols and notations used in this paper. Table 1 Basic symbols and notations Symbols Definition K Keyword-candidate list, K={k1,k2,…..kn} APK The preference keyword set of the active user PPK The preference keyword set of the previous user Sim(APK,PPK) The similarity between APK and PPK → WP A preference weight vector → WAP A preference weight vector of active user → WPP A preference weight vector of previous user In our method, two data structures keyword-candidate list and specialized domain thesaurus are introduced to help obtain users' preferences. The keyword-candidate list is a set of keywords about users’ preferences and multi criteria of the candidate services, which can be denoted as K={k1,k2,..kn} , n is the number of the keywords in the keyword-candidate list. An example of a simple keyword-candidate list of the hotel reservation system is described in Table 2. Keywords in the keyword-candidate list can be a word or multiple words related with the quality criteria of candidate services. In this paper, the preferences of previous users will be extracted from their reviews for candidate services and formalized into a keyword set. Usually, since some of words in reviews can not exactly match the corresponding keywords in the keyword-candidate list which characterize the same aspects as the words. The corresponding key-words should be extracted as well. In this paper, we assume that specialized domain thesauruses are built to sup-port the keyword extraction, and different domain thesauruses are built for different service domains. A domain thesaurus is a reference work of the keyword-candidate list that lists words grouped together according to the similarity of key-word meaning, including related and contrasting words and antonyms [21] [22]. TABLE 2 Keyword-candidate list of hotel reservation system No Keyword No Keyword No keyword 1 Service 7 Transportation 13 Airport 2 Rooom 8 Family,friends 14 Wi-fi 3 Shopping 9 Location 15 Environment 4 Cleanliness 10 Quite 16 Bar 5 Food 11 View 17 Beach 6 value 12 Fittness A Keyword-aware Service Recommendation Method: The main steps of KASR are depicted in Fig.1, which are described in detail as follows.
  • 4. ISSN 2350-1022 International Journal of Recent Research in Mathematics Computer Science and Information Technology Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org Page | 82 Paper Publications (1) Capture user preferences by a keyword-aware approach: In this step, the preferences of active users and previous users are formalized into their corresponding preference keyword sets respectively. In this paper, an active user refers to a current user needs recommendation. Fig:1 KASR’s main steps Preferences of an active user: An active user can give his/her preferences about candidate services by selecting keywords from a keyword-candidate list, which reflect the quality criteria of the services he/she is concerned about. The preference keyword set of the active user can be denoted as APK={ak1,ak2,..akl} where aki (1≤i≤l) is ith keyword selected from the keyword-candidate list by the active user, l is the number of selected keywords. Preferences of previous users: The preferences of a previous user for a candidate service are extracted from his/her reviews for the service according to the keyword-candidate list and domain thesaurus. And a review of the previous user will be formalized into the preference key-word set of him/her, which can be denoted as ,PPK={pk1,pk2,…pkh} where pki (1≤i≤h) is the ith keyword extracted from the review, h is the number of extracted keywords. The keyword extraction process is described as follows: a) Preprocess: Firstly, HTML tags and stop words in the reviews snippet collection should be removed to avoid affecting the quality of the keyword extraction in the next stage. And the Porter Stemmer algorithm (keyword strip-ping) [23] is used to remove the commoner morphological and in flexional endings from words in English. Its main use is as part of a term normalization process that is usually done when setting up Information Retrieval systems [23]. b) Keyword extraction: In this phase, each review will be transformed into a corresponding keyword set according to the keyword-candidate list and domain thesaurus. If the review contains a word in the domain thesaurus, then the corresponding keyword should be extracted into the preference keyword set of the user. For example, if a review of a previous user for a hotel has the word “spa”, which is corresponding to the keyword ``Fitness" in the domain thesaurus, then the keyword ``Fitness" should be contained in the preference keyword set of the previous user. If a keyword appears more than once in a review, the times of repetitions will be recorded. In this paper, it is regarded that keywords appearing multiple times are more important. The times of repetitions will be used to calculate the weight of the keyword in preference keyword set in the next step. (2) Similarity computation: The second step is to identify the reviews of previous users who have similar tastes to an active user by finding neighborhoods of the active user based on the similarity of their preferences. Before similarity computation, the reviews unrelated to the active user's preferences will be filtered out by the intersection concept in set
  • 5. ISSN 2350-1022 International Journal of Recent Research in Mathematics Computer Science and Information Technology Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org Page | 83 Paper Publications theory. If the intersection of the preference keyword sets of the active user and a previous user is an empty set, then the preference keyword set of the previous user will be filtered out. Two similarity computation methods: a) Approximate similarity computation: The approximate similarity computation method is for the case that the weights of the keywords in the preference keyword set are unavailable. A frequently used method for comparing the similarity and diversity of sample sets, Jaccard coefficient, is applied in the approximate similarity computation. Jaccard coefficient is measurement of asymmetric information on binary (and non- binary) variables, and it is useful when negative values give no information. The similarity between the preferences of the active user and a previous user based on Jaccard coefficient is described as follows: |APKᴖPPK| Sim(APK, PPK)=Jaccard(APK,PPK)= |APKᴗPPK| (1) Where APK is the preference keyword set of the active user, PPK is the preference keyword set of a previous user. And the weight of the keywords is not considered in this approach. Algorithm 1, SIM-ASC, illustrates the functionality of the approximate similarity computation method. Algorithm1: SIM-ASC Input: The preference keyword set of the active user APK The preference keyword set of a previous user PPKj Output: The similarity of APK and PPKj,simASC(APK PPKj) |APKᴖPPKj| 1. simASC(APK PPKj)= |APKᴗPPKj| 2. Return the similarity of APK and PPKj, simASC(APK PPKj) b) Exact similarity computation: The exact similarity computation method is for the case that the weight of the keywords are available. A cosine-based approach is applied in the exact similarity computation, which is similar to the Vector Space Model (VSM) in information retrieval [24] [25]. In this cosine-based approach, The preference keyword sets of the active user and previous users will be transformed into n-dimensional weight vectors respectively, namely preference weight vector, which can be denoted as WP , n is the number of keywords in the keyword-candidate list, is the weight of the keyword ki in the keyword-candidate list. If the keyword ki is not contained in the preference keyword set, then the weight of ki in the preference weight vector is 0, i.e., wi=0. The preference weight vectors of the active user and a previous user are noted as → and → , respectively. WAP WPP In this paper, we use the Analytic Hierarchy Process (AHP) model to decide the weight of the keywords in the preference keyword set of the active user. AHP method is provided by Saaty in 1970s to choose the best satisfied business role for its hierarchy nature [26]. The weight computing based on the AHP model is decied as follows: Firstly, we construct the pair-wise comparison matrix in terms of the relative importance between each two key-words. The pair-wise comparison matrix Am = (aij) m must satisfy the following properties, aij represents the relative importance of two keywords:
  • 6. ISSN 2350-1022 International Journal of Recent Research in Mathematics Computer Science and Information Technology Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org Page | 84 Paper Publications 1) aij =1, i=j=1,2,3…………m 2) aij=1/ aij, i=j=1,2,3…………m and i≠j 3) aij= aik/ ajk , i=j=k=1,2,3…….m and i≠j After checking the consistence of the matrix, then we calculate the weight by the following function: Wi ∑ ∑ (2) Where aij is the relative importance between two keywords, m is the number of the keywords in the preference keyword set of the active user The weight vector of the preference keyword set of a previous user can be decided by the term frequency/inverse document frequency (TF-IDF) measure [27], which is one of the best-known measures for specifying the weight of key- words in Information Retrieval. In the TF-IDF approach, to calculate the preference weight vector of a previous user u', all reviews by user u' should be collected. Here, all reviews contain the reviews by user u' for the candidate services and similar ser-vices not in the candidate services. The reviews should also be transformed into keyword sets respectively according to the keyword- candidate list and the domain thesaurus. TF, the term frequency of the keyword pki in the preference keyword set of user u' is defined as TF =Npki / ∑ 3) Where is the number of occurrences of the keyword pki in all the keyword sets of the reviews commented by the same user u', g is the number of the keywords in the preference keyword set of the user u'. The inverse document frequency (IDF) is obtained by dividing the number of all reviews by the number of reviews containing the keyword pki. IDF = log |R' | / |r': pki∈ r'| (4) Where |R'| is the total number of the reviews commented by user u’, and |r': pki∈ r'| is the number of reviews where keyword pki appears. So the TF-IDF weight of the keyword pki in the preference keyword set of user u' can be decided by the following function: wpki=TF×IDF= (Npki / ∑ ) log |R' | / |r': pki∈ r'|) (5) Then the similarity based on the cosine-based approach is defined as follows: → → → → wAP wPP Sim(APK, PPK)=cos(wAP, wPP) = → → wAP||2 ||wPP||2 → → ∑ = → → (6) ∑ ∑
  • 7. ISSN 2350-1022 International Journal of Recent Research in Mathematics Computer Science and Information Technology Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org Page | 85 Paper Publications → → Where wAP and wPP are respectively the preference weight vectors of the active user and a previous user. Vector wAP,i is the i-th dimension of vector wAP and represents the weight of the keyword ki in preference keyword set APK,vector wpp,i is the i-th dimension of vector wpp and represents the weight of the keyword ki in preference keyword set PPK. Algorithm 2, SIM-ESC, illustrates the functionality of the exact similarity computation method. Algorithm 2: SIM-ESC (Exact Similarity Computation ) Input: The preference keyword set of the active user APK The preference keyword set of a previous user PPKj. Output: The similarity of APK and PPKj, simESC(APK,PPKj). 1: for each keyword ki in the keyword-candidate list 2: if ki APK then 3: get → by formula(2) WAP,i 4: else → WAP,i = 0 5: end if 6: if ki PPKj then 7: get → by formula (5) Wpp,i 8: else → WPP,i = 0 9: end if 10: end for 11: get simESC(APK,PPKj).by formula(6) 12: return the similarity of APK and PPKj simESC(APK,PPKj). (3) Calculate personalized ratings and generate recom-mendations: Based on the similarity of the active user and previous us-ers, further filtering will be conducted. Given a threshold δ, if sim(APK,PPKj)< δ, the preference keyword set of a pre-vious user PPKj will be filtered out, otherwise PPKj will be retained. Finally, a personalized service recommendation list will be presented to the user and the ser-vice(s) with the highest rating(s) will be recommended to him/her. We use a weighted average approach to calculate the personalized rating pr of a service for the active user. pr = r + k ∑ k = 1/∑
  • 8. ISSN 2350-1022 International Journal of Recent Research in Mathematics Computer Science and Information Technology Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org Page | 86 Paper Publications where sim(APK,PPKj) is the similarity of the preference keyword set of the active user and the preference keyword set of a previous user PPKj; multiplier k serves as a normalizing factor, ˆR( R cap) denotes the set of the remaining preference keyword sets of previous users after filtering; rj is the rating of the corresponding review of PPKj, and r bar is defined as the average ratings of the candidate service. we can calculate the personalized ratings of all candidate services for the active user. Then we can rank the services by the personalized ratings and present a personalized service recommendation list to him/her. we assume that the services with higher ratings are more preferable to the user. So the service(s) with the highest rating(s) will be recommended to the active user. we can recommend the Top-K services to the user. Algorithm 3: Basic Algorithm of KASR Input: The preference keyword set of the active user APK The candidate services WS={ws1,ws2,……wsN} The threshold δ in the filtering phase The number K . Output: The services with the Top-K highest ratings {tws1, tws2, …, twsK}. 1: for each service wsi WS 2: ˆR = Ф sum = 0, r = 0 3: for each review Rj of service wsi 4: process the review into a preference keyword set PPKj 5: if PPKj ᴖ APK = Ф then 6: insert PPKj into ˆR 7: end if 8: end for 9: for each keyword set PPK j ˆR 10: 11: then 12: remove PPKj from ˆR 13: else sum=sum+1, r=r+rj 14: end if 15: end for 16: r = r/sum 17: get pri by formula (7) 18: end for 19: sort the services according to the personalized ratings pri 20: return the services with the Top-K highest ratings {tws1, tws2… twsk} Algorithm 3 illustrates the basic algorithm of KASR. The input contains the preference keyword set of the active user APK, the candidate services WS={ws1,ws2 ,…wsN} the threshold δ in the filtering phase, and the number K. In line 2, ˆR is used to store the remaining preference keyword sets of previous users, and sum is to record the number of the remaining
  • 9. ISSN 2350-1022 International Journal of Recent Research in Mathematics Computer Science and Information Technology Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org Page | 87 Paper Publications preference keyword sets of previous users. Line 3 to line 8 is used to process each review of the previous users into the corresponding preference keyword sets, and then do a simple filtering to filter out the reviews unrelated with the active user's preferences. Line 9 to line 15 are to calculate the similarity of APK and PPKj, and then filter out the keyword set PPKj whose similarity with APK is less than the threshold δ. In this paper, there are two methods to calculate the similarity: approximate similarity computation (see Algorithm 1) and exact similarity computation (see Algorithm 2). Line 16 to line 18 is to calculate the personalized ratings of the candidate services for the active user. Finally, line 19 and line 20 is to sort the candidate services according to the personalized ratings and recommend the services with the Top-K highest ratings to the active user. V. RESULTS AND DISCUSSION To verify the scalability of KASR, experiment is conducted respectively in a cluster of nodes ranging from 1 to 8. There are 4 synthetic datasets used in the experiments (128M, 256M, 512M and 1G datasize). Fig. 2 shows the speedup of KASR (Here, KASR-ESC method is adopted in the scalability experiment). From Fig. 2, we can see that the speedup of KASR increases relative linearly with the growth of the number of nodes. Meanwhile, larger dataset obtained a better speedup. When the datasize is 1G and the number of nodes is 8, the speedup value reaches 6.412, which is 80.15% (6.412/8=80.15%) of the idealspeedup. The experimental result shows that KASR on Map-Reduce in Hadoop platform has good scalability over Big Data and performs better with larger dataset. Fig. 2 Speedup of KASR Overall, these experimental results show that KASR per-forms well in accuracy and KASR on Mapreduce framework has good scalability in Big Data environment. VI. CONCLUSION In this paper presenting a personalized service recommendation list and recommending the most appropriate service(s) to the users. Moreover, to improve the scalability and efficiency of KASR in Big Data environment, we have implemented it on a MapReduce framework in Hadoop platform. Finally, the experimental results demonstrate that KASR significantly improves the accuracy and scalability of service recommender systems over existing approaches.
  • 10. ISSN 2350-1022 International Journal of Recent Research in Mathematics Computer Science and Information Technology Vol. 1, Issue 2, pp: (79-88), Month: October 2014 – March 2015, Available at: www.paperpublications.org Page | 88 Paper Publications VII. FUTURE SCOPE Implement Collaborative filtering algorithm on mapreduce is adopted to generate appropriate recommendations. To improve the scalability and efficiency of Keyword Service Recommendation Method in Bigdata Environment,to distinguish the positive and negative preferences of the users from their reviews to make the predictions more accurate. REFERENCES [1] Key Word Aware Service Recommendaion Method. [2] C. Lynch, “Big Data: How do your data grow?” Nature, Vol. 455, No. 7209, pp. 28-29, 2008.. [3] F. Chang, J. Dean, S. Ghemawat, and W. C. Hsieh, “Bigtable: A distributed storage system for structured data,” ACM Transactions on Computer Systems,Vol. 26, No. 2 (4) , 2008. [4] W. Dou, X. Zhang, J. Liu, J. Chen, “HireSome-II: Towards Privacy-Aware Cross-Cloud Service Composition for Big Data Applica-tions,” IEEE Transactions on Parallel and Distributed Systems, 2013. [5] G. Linden, B. Smith, and J. York, “Amazon.com Recommendations: Item-to-Item Collaborative Filtering,” IEEE Internet Computing, Vol. 7, No.1, pp. 76-80, 2003. [6] M. Bjelica, “Towards TV Recommender System Experiments with User Modeling,” IEEE Transactions on Consumer Electronics, Vol. 56, No.3, pp. 1763-1769, 2010. [7] M. Alduan, F. Alvarez, J. Menendez, and O. Baez, “Recommender System for Sport Videos Based on User Audiovisual Consumption,” IEEE Transactions on Multimedia, Vol. 14, No.6, pp. 1546-1557, 2013. [8] Y. Chen, A. Cheng and W. Hsu, “Travel Recommendation by Mining People Attributes and Travel Group Types From Community-Contributed Photos”. IEEE Transactions on Multimedia, Vol. 25, No.6, pp. 1283-1295, 2012.www.ijecse.org. Author’s Profile: Dayanand Bhovi completed the bachelors degree in Informaton Science &Engineering from University of VTU and presently pursuing Masters in Engineering in Computer Science & Engineering at Mangalore Institute of Technology, Mangalore. Ashwin kumar completed bachelors and masters degree in Computer Science and Engineering. Currently working as Sr. Assistant professor in Mangalore Institute Technology, Mangalore.