SlideShare a Scribd company logo
1 of 5
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1683
Information Retrieval and De-duplication for Tourism Recommender
System
Rajesh Thasal1, Shubhada Yelkar2, Amit Tare3, Sharmila Gaikwad4
1, 2, 3 Student VIII Sem, BE, Computer Engineering, RGIT, Mumbai, India
4 Professor, Computer Engineering, RGIT, Mumbai, India.
---------------------------------------------------------------------***-------------------------------------------------------------------
Abstract - Nowadaysmany people rely on online services
to plan a trip. However, they are usually faced with the
problem of being supply with lots of information. In
consequence, they have to invest great deal of time todecide
what to visit on the basis of their preferences. This paper
presents a working of Tourism Recommendation System
which helps tourist to find best location suitable for their
tourist profile. Recommender system is implemented using
hadoop technology. For providing best object with great
accuracy system goes through four important phases i.e.
scrapping, mapping, de-duplication and recommendation.
First the data object is scrapped from differentwebsiteusing
jsoup library, after that datasets are mapped using Pig Latin
script of hadoop. Mapping maps only required attributes
from datasets. In data de-duplication process duplicate
copies of same object will be resolved and only unique
objects will be saved in datasets. In last phase
(Recommendation) the best suitable location is provided
with the help of tourist profile which is created by tourist.
This tourist profile consists of name, duration of visit,
location rating, point of interest, place rating etc.
Key Word: Hadoop, Jsoup, Pig-Latin, De-duplication,
Recommendation.
1. INTRODUCTION
Information Retrieval and De-duplication for Tourism
Recommender System is the system which recommendsthe
tourist interesting objects on the basis of tourist profile.
Tourist profile consists of duration of visit, point of interest,
property type and their preferences about different types of
objects and events. On the basis of tourist profile,
recommendation system is responsible for searching
suitable information from database and returns the best
objects for the end user. The information present in the
database is extracted through scrapper. System uses the
Jsoup java library package for extracting necessary
information from the web pages. While extracting data
sometimes system gets duplicate informationwhichreduces
the quality of information. To overcome thisproblemsystem
uses De-duplication technique. De-duplication is a
specialized data compression technique for removing
duplicate copies of repeating data. De-duplication improves
system recommendation and object information quality of
system.
2. OBJECTIVES
The main purpose of Tourism Recommender System is to
create a personalization tool that recommends tourist a list
of information items that best the individual tastes. A
recommender system infers the user preferences by
analyzing the available user data, information about hotels
and information about the point of interest. The accuracy of
the recommendations highly depends on the amount of
available information. System gathers information from
different web pages and represents only uniqueinformation
by applying filtering and De-duplication process. De-
duplication of data is the important motive of this system.
De-duplication improves system recommendations and
object information quality.
3. PROPOSED SYSTEM
Proposed system is designed to overcome the problem of
data duplication. Proposed system starts with extractingthe
data from different websites. While extracting data
sometime system get duplicate data which reduces the
quality of our system. So to overcome this problem system
used de-duplication technique. Before de-duplication
mapping is done on datasets and after that filtration process
is applied. Proposed system givesrecommendationbasedon
user’s choice of preferences. Userspreferences are compare
with the de-duplicate datasets, if correct matchisfoundthen
recommendation of location with additional information,
hotel description and percentage of accuracyareprovidedto
the users.
The proposed system goes through the four phases:
1. Scrapping
2. Mapping
3. De-duplication
4. Recommendation
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 04 | Apr-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1684
Fig. 3. System Architecture
4. FUNCTIONS OF PHASES:
4.1 Scrapping
Scrapping is a technique employed to extract large
amounts of data from websites wherebythedataisextracted
and saved to a database. Data is scrapped using jsoupparser.
Jsoup is a Java based library to work with HTML based
content. Jsoup used to parse HTML document. Jsoup
provides API to extract and manipulate data from URL or
HTML file.
The data is scrapped from the two websites thatare
MTDC and MAKEMYTRIP. The fig 4.1 and 4.1.2 are the
sample datasets of MTDC and MAKEMYTRIP websites.
Fig. 4.1.1 sample dataset of MTDC
Fig.4.1.2 Sample Dataset of MAKEMYTRIP
4.2 Mapping
The data objects which are scrapped from the
different websites need to be mapped according to the
attributes. This is done by mapping the attributes of the
objects with the help of Pig Latin in Hadoop. Inmappingonly
required fields are taken for further processing such as
filtering. Pig uses simple sql-like scripting language called
Pig Latin. Pig Latin script is worked as follows:
LOAD: A load statement reads data from the file system.
TRANSFORM: A series of "transformation" statements
(FILTER, FOREACH, GENERATE) to process the data.
STORE: A store statement writes output to the file system.
DUMP: a dump statement displays output to the screen
Fig . 4.2.1 Mapping
Fig 4.2.2 Mapped and Merged dataset
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 04 | Apr-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1685
4.3 De-duplication
De-duplication is a specialized data compression technique
for removing duplicate copies of repeating data. In this
technique system search for duplicate copies of data if
duplicate is found during analysis then only single copy of
that attribute is stored in dataset and otherduplicatescopies
are removed using Pig Latin script.
This technique is used to improve storage utilization and
speed of execution.
4.4 Recommendation
In recommendation first tourist will fill up the touristprofile
form which includes point of interest, property type,
duration of visit, location rating and place rating. After that
this choice of preferences are compared with de-duplicated
dataset. If perfect or nearby match is found then that
location will be given as recommendationwithpercentageof
accuracy.
Fig. 4.4.1 Tourist Profile
Fig. 4.4.2 Recommendation
5. TECHNOLOGY USED FOR IMPLEMENTATION
5.1 HADOOP
Hadoop is an open-source framework that allows
users to store and process large amount of Data in a
distributed environment across clusters of computer using
programming models. It is designed to scale up from single
servers to several of machines with high degree of fault
tolerance. Data in a Hadoop cluster is broken down into
smaller chunks and distributed throughout the cluster like
the Map and Reduce functions that are executed on smaller
chunks of larger data sets, and this provides the scalability
required for Big Data processing.
5.2 APACHE PIG
Apache pig is a project which is lies on top of
Hadoop, and it provides scripting language to use Hadoop’s
Map-Reduce functionality, Pig Latin scriptisusedtoperform
read, filter, transform, join, dump and write data with less
programming skills. Pig allows data workers to write
complex data transformations withouttheknowledgeofjava
programming language.
Pig is made up of two main components
a) Pig Latin
b) Runtime environment
A. PIG LATIN
Pig uses simple sql-likescripting languagecalledPig
Latin. Pig Latin is relatively simple language that executes a
set of commands. Pig latin statements works with relations
(bag with collection of tuple), a Pig relation is similar to a
table in a relational database, where the tuples in the bag
correspond to the records in a table. Pig Latinstatementscan
be in multiple lines and it must end with a semi-colon.
Pig Latin helps non-java developers as it takes less time to
code, for example in a test 5 lines of Pig Latin ≈ 100 lines of
java. This takes3 hoursto write in java but 10 minutesinPig
Latin.
Table 1. Pig Commands
FUNCTION PIG COMMANDS
Command to load data
LOAD()
PigStorage()
BinStorage()
TextLoader()
Command to work with
data
Filter
Foreach
Join
Group
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 04 | Apr-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1686
Cogroup
Union
Split
Command to debug the data Describe
Explain
Illustrate
Command to retrieve results Dump
Store
B. RUN TIME ENVIRONMENT
The pig execution environment has two modes of
execution they are,
1. Local mode: Pig scripts runs on a single machine were
Hadoop Map-Reduce and HDFS are not required.
2. Hadoop or Map-Reduce mode: pig scripts runs on a
Hadoop. Pig statements can be executed in three different
ways, in which all the three are executed in both local and
Hadoop modes.
1. Grunt shell: Allowsusersto type pig commandsmanually
using pig’s interactive shell, grunt.
2. Script file: Place pig commandsin a script file andrunthe
script file.
3. Embedded: Embed the pig statements in a host language
and run the statement.
5.3 JSOUP
Jsoup is a java html parser. It is a java library that is used to
parse HTML document. Jsoup provides API to scrap and
change data from URL or HTML file. It uses DOM, CSS and
Jquery-like methods for scrapping and manipulating file.
J
soup: parsing string
Syntax:
Document d = Jsoup.parse(html);
Where
 Document - document object represents theHTML
DOM.
 Jsoup - main class to parse the given HTML String.
 Html - HTML String.
Parsing data from Holidiffy.com
6. FLOWCHART
Fig. 6 Flowchart
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 05 Issue: 04 | Apr-2018 www.irjet.net p-ISSN: 2395-0072
© 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1687
7. CONCLUSION
This paper gives an overview of implementation of Tourism
Recommender System which gives the personalize
suggestion to the tourist for selecting best location which is
suitable for their profile. This system is implemented using
hadoop technology so that processing on huge amount of
dataset becomes easy. For implementing mapping and de-
duplication Pig Latin script is used. Pig Latin is an extension
of map-reduce. Map-reduce functionality of hadoop is
implemented using pig. With the help of pig programmers
does not have to write lines of code, simply they type
command in grunt shell toreduce programming complexity.
With the help of pig, recommender systemissimpleandeasy
to understand. It also improves the execution speed of
recommender system.
8. REFERENCES
[1] Bhosale, Harshawardhan S., and Devendra P. Gadekar."A
Review Paper on Big Data and Hadoop." International
Journal of Scientific and Research Publications 4.10 (2014)
[2] Chavan, Ms Vibhavari, and Rajesh N. Phursule. "Survey
paper on big data." Int. J. Comput. Sci. Inf. Technol 5.6
(2014): 7932-7939.
[3] O. Benjelloun, H. Garcia-Molina, D. Menestrina, Q. Su, S.E.
Whang, and J. Widom. Swoosh: a generic approach to entity
resolution. The VLDB Journal, March 2008. VLDB Journal
(Online First) link:
http://dx.doi.org/10.1007/s00778-008-0098-x.
[4] M. Bilenko. Learning to combine trained distancemetrics
for duplicate detection in databases. Submitted to CIKM-
2002, (February):1–19, 2002.
[5] Adomavicius, G & Tuzhilin, A (2005), Toward the next
generation of recommender systems: A survey of the state-
of-the-art and possible extensions, IEEE Transactions on
Knowledge and Data Engineering, vol. 17, no. 6, pp.734-749.
[6] J. L. Herlocker, J. A. Konstan, L. G. Terveen, & J. T. Riedl,
Evaluating collaborative filtering recommender systems,
ACM Transactions on Information Systems, vol. 22, no. 1,pp.
5-53, 2004.

More Related Content

What's hot

Development of pattern knowledge discovery framework using
Development of pattern knowledge discovery framework usingDevelopment of pattern knowledge discovery framework using
Development of pattern knowledge discovery framework usingIAEME Publication
 
Frequent Item Set Mining - A Review
Frequent Item Set Mining - A ReviewFrequent Item Set Mining - A Review
Frequent Item Set Mining - A Reviewijsrd.com
 
Materials Informatics and Python
Materials Informatics and PythonMaterials Informatics and Python
Materials Informatics and PythonShintaro Fukushima
 
Data characterization towards modeling frequent pattern mining algorithms
Data characterization towards modeling frequent pattern mining algorithmsData characterization towards modeling frequent pattern mining algorithms
Data characterization towards modeling frequent pattern mining algorithmscsandit
 
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITYSOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITYIJDKP
 
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...IJDKP
 
Effieient Algorithms to Find Frequent Itemset using Data Mining
Effieient Algorithms to Find Frequent Itemset using Data MiningEffieient Algorithms to Find Frequent Itemset using Data Mining
Effieient Algorithms to Find Frequent Itemset using Data MiningIRJET Journal
 
Review: Data Driven Traffic Flow Forecasting using MapReduce in Distributed M...
Review: Data Driven Traffic Flow Forecasting using MapReduce in Distributed M...Review: Data Driven Traffic Flow Forecasting using MapReduce in Distributed M...
Review: Data Driven Traffic Flow Forecasting using MapReduce in Distributed M...AM Publications
 
The Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataThe Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataOlaf Hartig
 
Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Valery Tkachenko
 
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...rahulmonikasharma
 
Modern association rule mining methods
Modern association rule mining methodsModern association rule mining methods
Modern association rule mining methodsijcsity
 

What's hot (14)

Development of pattern knowledge discovery framework using
Development of pattern knowledge discovery framework usingDevelopment of pattern knowledge discovery framework using
Development of pattern knowledge discovery framework using
 
EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8
EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8
EXECUTION OF ASSOCIATION RULE MINING WITH DATA GRIDS IN WEKA 3.8
 
Frequent Item Set Mining - A Review
Frequent Item Set Mining - A ReviewFrequent Item Set Mining - A Review
Frequent Item Set Mining - A Review
 
Materials Informatics and Python
Materials Informatics and PythonMaterials Informatics and Python
Materials Informatics and Python
 
Data characterization towards modeling frequent pattern mining algorithms
Data characterization towards modeling frequent pattern mining algorithmsData characterization towards modeling frequent pattern mining algorithms
Data characterization towards modeling frequent pattern mining algorithms
 
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITYSOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
SOURCE CODE RETRIEVAL USING SEQUENCE BASED SIMILARITY
 
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
USING ONTOLOGIES TO IMPROVE DOCUMENT CLASSIFICATION WITH TRANSDUCTIVE SUPPORT...
 
Effieient Algorithms to Find Frequent Itemset using Data Mining
Effieient Algorithms to Find Frequent Itemset using Data MiningEffieient Algorithms to Find Frequent Itemset using Data Mining
Effieient Algorithms to Find Frequent Itemset using Data Mining
 
Review: Data Driven Traffic Flow Forecasting using MapReduce in Distributed M...
Review: Data Driven Traffic Flow Forecasting using MapReduce in Distributed M...Review: Data Driven Traffic Flow Forecasting using MapReduce in Distributed M...
Review: Data Driven Traffic Flow Forecasting using MapReduce in Distributed M...
 
The Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked DataThe Impact of Data Caching of on Query Execution for Linked Data
The Impact of Data Caching of on Query Execution for Linked Data
 
Data mining weka
Data mining wekaData mining weka
Data mining weka
 
Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...Using publicly available resources to build a comprehensive knowledgebase of ...
Using publicly available resources to build a comprehensive knowledgebase of ...
 
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
PAS: A Sampling Based Similarity Identification Algorithm for compression of ...
 
Modern association rule mining methods
Modern association rule mining methodsModern association rule mining methods
Modern association rule mining methods
 

Similar to Tourism Recommender System using Hadoop and De-duplication

IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...IRJET Journal
 
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS ijcax
 
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS ijcax
 
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS ijcax
 
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351IRJET Journal
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine ScrapperIRJET Journal
 
IRJET - Automated Fraud Detection Framework in Examination Halls
 IRJET - Automated Fraud Detection Framework in Examination Halls IRJET - Automated Fraud Detection Framework in Examination Halls
IRJET - Automated Fraud Detection Framework in Examination HallsIRJET Journal
 
Association Rule Mining using RHadoop
Association Rule Mining using RHadoopAssociation Rule Mining using RHadoop
Association Rule Mining using RHadoopIRJET Journal
 
A Novel Framework on Web Usage Mining
A Novel Framework on Web Usage MiningA Novel Framework on Web Usage Mining
A Novel Framework on Web Usage MiningIRJET Journal
 
IRJET- Secure Distributed Data Mining
IRJET- Secure Distributed Data MiningIRJET- Secure Distributed Data Mining
IRJET- Secure Distributed Data MiningIRJET Journal
 
IRJET- Analysis of Boston’s Crime Data using Apache Pig
IRJET- Analysis of Boston’s Crime Data using Apache PigIRJET- Analysis of Boston’s Crime Data using Apache Pig
IRJET- Analysis of Boston’s Crime Data using Apache PigIRJET Journal
 
Fast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data AnalysisFast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data AnalysisIRJET Journal
 
A Data Warehouse Design and Usage.pdf
A Data Warehouse Design and Usage.pdfA Data Warehouse Design and Usage.pdf
A Data Warehouse Design and Usage.pdfCassie Romero
 
IRJET- A Comparative Research of Rule based Classification on Dataset using W...
IRJET- A Comparative Research of Rule based Classification on Dataset using W...IRJET- A Comparative Research of Rule based Classification on Dataset using W...
IRJET- A Comparative Research of Rule based Classification on Dataset using W...IRJET Journal
 
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
Peer-to-Peer Data Sharing and Deduplication using Genetic AlgorithmPeer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
Peer-to-Peer Data Sharing and Deduplication using Genetic AlgorithmIRJET Journal
 
Study on Relavance Feature Selection Methods
Study on Relavance Feature Selection MethodsStudy on Relavance Feature Selection Methods
Study on Relavance Feature Selection MethodsIRJET Journal
 
Comparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & PythonComparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & PythonIRJET Journal
 
Machine Learning-Based Phishing Detection
Machine Learning-Based Phishing DetectionMachine Learning-Based Phishing Detection
Machine Learning-Based Phishing DetectionIRJET Journal
 
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...IRJET Journal
 
IRJET- Placement Portal
IRJET- Placement PortalIRJET- Placement Portal
IRJET- Placement PortalIRJET Journal
 

Similar to Tourism Recommender System using Hadoop and De-duplication (20)

IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
IRJET- Classification of Pattern Storage System and Analysis of Online Shoppi...
 
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
 
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
 
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
IDENTIFY NAVIGATIONAL PATTERNS OF WEB USERS
 
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
Irjet v4 iA Survey on FP (Growth) Tree using Association Rule Mining7351
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine Scrapper
 
IRJET - Automated Fraud Detection Framework in Examination Halls
 IRJET - Automated Fraud Detection Framework in Examination Halls IRJET - Automated Fraud Detection Framework in Examination Halls
IRJET - Automated Fraud Detection Framework in Examination Halls
 
Association Rule Mining using RHadoop
Association Rule Mining using RHadoopAssociation Rule Mining using RHadoop
Association Rule Mining using RHadoop
 
A Novel Framework on Web Usage Mining
A Novel Framework on Web Usage MiningA Novel Framework on Web Usage Mining
A Novel Framework on Web Usage Mining
 
IRJET- Secure Distributed Data Mining
IRJET- Secure Distributed Data MiningIRJET- Secure Distributed Data Mining
IRJET- Secure Distributed Data Mining
 
IRJET- Analysis of Boston’s Crime Data using Apache Pig
IRJET- Analysis of Boston’s Crime Data using Apache PigIRJET- Analysis of Boston’s Crime Data using Apache Pig
IRJET- Analysis of Boston’s Crime Data using Apache Pig
 
Fast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data AnalysisFast Range Aggregate Queries for Big Data Analysis
Fast Range Aggregate Queries for Big Data Analysis
 
A Data Warehouse Design and Usage.pdf
A Data Warehouse Design and Usage.pdfA Data Warehouse Design and Usage.pdf
A Data Warehouse Design and Usage.pdf
 
IRJET- A Comparative Research of Rule based Classification on Dataset using W...
IRJET- A Comparative Research of Rule based Classification on Dataset using W...IRJET- A Comparative Research of Rule based Classification on Dataset using W...
IRJET- A Comparative Research of Rule based Classification on Dataset using W...
 
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
Peer-to-Peer Data Sharing and Deduplication using Genetic AlgorithmPeer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
Peer-to-Peer Data Sharing and Deduplication using Genetic Algorithm
 
Study on Relavance Feature Selection Methods
Study on Relavance Feature Selection MethodsStudy on Relavance Feature Selection Methods
Study on Relavance Feature Selection Methods
 
Comparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & PythonComparing the performance of a business process: using Excel & Python
Comparing the performance of a business process: using Excel & Python
 
Machine Learning-Based Phishing Detection
Machine Learning-Based Phishing DetectionMachine Learning-Based Phishing Detection
Machine Learning-Based Phishing Detection
 
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
 
IRJET- Placement Portal
IRJET- Placement PortalIRJET- Placement Portal
IRJET- Placement Portal
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 

Recently uploaded (20)

HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 

Tourism Recommender System using Hadoop and De-duplication

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 03 | Mar-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1683 Information Retrieval and De-duplication for Tourism Recommender System Rajesh Thasal1, Shubhada Yelkar2, Amit Tare3, Sharmila Gaikwad4 1, 2, 3 Student VIII Sem, BE, Computer Engineering, RGIT, Mumbai, India 4 Professor, Computer Engineering, RGIT, Mumbai, India. ---------------------------------------------------------------------***------------------------------------------------------------------- Abstract - Nowadaysmany people rely on online services to plan a trip. However, they are usually faced with the problem of being supply with lots of information. In consequence, they have to invest great deal of time todecide what to visit on the basis of their preferences. This paper presents a working of Tourism Recommendation System which helps tourist to find best location suitable for their tourist profile. Recommender system is implemented using hadoop technology. For providing best object with great accuracy system goes through four important phases i.e. scrapping, mapping, de-duplication and recommendation. First the data object is scrapped from differentwebsiteusing jsoup library, after that datasets are mapped using Pig Latin script of hadoop. Mapping maps only required attributes from datasets. In data de-duplication process duplicate copies of same object will be resolved and only unique objects will be saved in datasets. In last phase (Recommendation) the best suitable location is provided with the help of tourist profile which is created by tourist. This tourist profile consists of name, duration of visit, location rating, point of interest, place rating etc. Key Word: Hadoop, Jsoup, Pig-Latin, De-duplication, Recommendation. 1. INTRODUCTION Information Retrieval and De-duplication for Tourism Recommender System is the system which recommendsthe tourist interesting objects on the basis of tourist profile. Tourist profile consists of duration of visit, point of interest, property type and their preferences about different types of objects and events. On the basis of tourist profile, recommendation system is responsible for searching suitable information from database and returns the best objects for the end user. The information present in the database is extracted through scrapper. System uses the Jsoup java library package for extracting necessary information from the web pages. While extracting data sometimes system gets duplicate informationwhichreduces the quality of information. To overcome thisproblemsystem uses De-duplication technique. De-duplication is a specialized data compression technique for removing duplicate copies of repeating data. De-duplication improves system recommendation and object information quality of system. 2. OBJECTIVES The main purpose of Tourism Recommender System is to create a personalization tool that recommends tourist a list of information items that best the individual tastes. A recommender system infers the user preferences by analyzing the available user data, information about hotels and information about the point of interest. The accuracy of the recommendations highly depends on the amount of available information. System gathers information from different web pages and represents only uniqueinformation by applying filtering and De-duplication process. De- duplication of data is the important motive of this system. De-duplication improves system recommendations and object information quality. 3. PROPOSED SYSTEM Proposed system is designed to overcome the problem of data duplication. Proposed system starts with extractingthe data from different websites. While extracting data sometime system get duplicate data which reduces the quality of our system. So to overcome this problem system used de-duplication technique. Before de-duplication mapping is done on datasets and after that filtration process is applied. Proposed system givesrecommendationbasedon user’s choice of preferences. Userspreferences are compare with the de-duplicate datasets, if correct matchisfoundthen recommendation of location with additional information, hotel description and percentage of accuracyareprovidedto the users. The proposed system goes through the four phases: 1. Scrapping 2. Mapping 3. De-duplication 4. Recommendation
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 04 | Apr-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1684 Fig. 3. System Architecture 4. FUNCTIONS OF PHASES: 4.1 Scrapping Scrapping is a technique employed to extract large amounts of data from websites wherebythedataisextracted and saved to a database. Data is scrapped using jsoupparser. Jsoup is a Java based library to work with HTML based content. Jsoup used to parse HTML document. Jsoup provides API to extract and manipulate data from URL or HTML file. The data is scrapped from the two websites thatare MTDC and MAKEMYTRIP. The fig 4.1 and 4.1.2 are the sample datasets of MTDC and MAKEMYTRIP websites. Fig. 4.1.1 sample dataset of MTDC Fig.4.1.2 Sample Dataset of MAKEMYTRIP 4.2 Mapping The data objects which are scrapped from the different websites need to be mapped according to the attributes. This is done by mapping the attributes of the objects with the help of Pig Latin in Hadoop. Inmappingonly required fields are taken for further processing such as filtering. Pig uses simple sql-like scripting language called Pig Latin. Pig Latin script is worked as follows: LOAD: A load statement reads data from the file system. TRANSFORM: A series of "transformation" statements (FILTER, FOREACH, GENERATE) to process the data. STORE: A store statement writes output to the file system. DUMP: a dump statement displays output to the screen Fig . 4.2.1 Mapping Fig 4.2.2 Mapped and Merged dataset
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 04 | Apr-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1685 4.3 De-duplication De-duplication is a specialized data compression technique for removing duplicate copies of repeating data. In this technique system search for duplicate copies of data if duplicate is found during analysis then only single copy of that attribute is stored in dataset and otherduplicatescopies are removed using Pig Latin script. This technique is used to improve storage utilization and speed of execution. 4.4 Recommendation In recommendation first tourist will fill up the touristprofile form which includes point of interest, property type, duration of visit, location rating and place rating. After that this choice of preferences are compared with de-duplicated dataset. If perfect or nearby match is found then that location will be given as recommendationwithpercentageof accuracy. Fig. 4.4.1 Tourist Profile Fig. 4.4.2 Recommendation 5. TECHNOLOGY USED FOR IMPLEMENTATION 5.1 HADOOP Hadoop is an open-source framework that allows users to store and process large amount of Data in a distributed environment across clusters of computer using programming models. It is designed to scale up from single servers to several of machines with high degree of fault tolerance. Data in a Hadoop cluster is broken down into smaller chunks and distributed throughout the cluster like the Map and Reduce functions that are executed on smaller chunks of larger data sets, and this provides the scalability required for Big Data processing. 5.2 APACHE PIG Apache pig is a project which is lies on top of Hadoop, and it provides scripting language to use Hadoop’s Map-Reduce functionality, Pig Latin scriptisusedtoperform read, filter, transform, join, dump and write data with less programming skills. Pig allows data workers to write complex data transformations withouttheknowledgeofjava programming language. Pig is made up of two main components a) Pig Latin b) Runtime environment A. PIG LATIN Pig uses simple sql-likescripting languagecalledPig Latin. Pig Latin is relatively simple language that executes a set of commands. Pig latin statements works with relations (bag with collection of tuple), a Pig relation is similar to a table in a relational database, where the tuples in the bag correspond to the records in a table. Pig Latinstatementscan be in multiple lines and it must end with a semi-colon. Pig Latin helps non-java developers as it takes less time to code, for example in a test 5 lines of Pig Latin ≈ 100 lines of java. This takes3 hoursto write in java but 10 minutesinPig Latin. Table 1. Pig Commands FUNCTION PIG COMMANDS Command to load data LOAD() PigStorage() BinStorage() TextLoader() Command to work with data Filter Foreach Join Group
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 04 | Apr-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1686 Cogroup Union Split Command to debug the data Describe Explain Illustrate Command to retrieve results Dump Store B. RUN TIME ENVIRONMENT The pig execution environment has two modes of execution they are, 1. Local mode: Pig scripts runs on a single machine were Hadoop Map-Reduce and HDFS are not required. 2. Hadoop or Map-Reduce mode: pig scripts runs on a Hadoop. Pig statements can be executed in three different ways, in which all the three are executed in both local and Hadoop modes. 1. Grunt shell: Allowsusersto type pig commandsmanually using pig’s interactive shell, grunt. 2. Script file: Place pig commandsin a script file andrunthe script file. 3. Embedded: Embed the pig statements in a host language and run the statement. 5.3 JSOUP Jsoup is a java html parser. It is a java library that is used to parse HTML document. Jsoup provides API to scrap and change data from URL or HTML file. It uses DOM, CSS and Jquery-like methods for scrapping and manipulating file. J soup: parsing string Syntax: Document d = Jsoup.parse(html); Where  Document - document object represents theHTML DOM.  Jsoup - main class to parse the given HTML String.  Html - HTML String. Parsing data from Holidiffy.com 6. FLOWCHART Fig. 6 Flowchart
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 05 Issue: 04 | Apr-2018 www.irjet.net p-ISSN: 2395-0072 © 2018, IRJET | Impact Factor value: 6.171 | ISO 9001:2008 Certified Journal | Page 1687 7. CONCLUSION This paper gives an overview of implementation of Tourism Recommender System which gives the personalize suggestion to the tourist for selecting best location which is suitable for their profile. This system is implemented using hadoop technology so that processing on huge amount of dataset becomes easy. For implementing mapping and de- duplication Pig Latin script is used. Pig Latin is an extension of map-reduce. Map-reduce functionality of hadoop is implemented using pig. With the help of pig programmers does not have to write lines of code, simply they type command in grunt shell toreduce programming complexity. With the help of pig, recommender systemissimpleandeasy to understand. It also improves the execution speed of recommender system. 8. REFERENCES [1] Bhosale, Harshawardhan S., and Devendra P. Gadekar."A Review Paper on Big Data and Hadoop." International Journal of Scientific and Research Publications 4.10 (2014) [2] Chavan, Ms Vibhavari, and Rajesh N. Phursule. "Survey paper on big data." Int. J. Comput. Sci. Inf. Technol 5.6 (2014): 7932-7939. [3] O. Benjelloun, H. Garcia-Molina, D. Menestrina, Q. Su, S.E. Whang, and J. Widom. Swoosh: a generic approach to entity resolution. The VLDB Journal, March 2008. VLDB Journal (Online First) link: http://dx.doi.org/10.1007/s00778-008-0098-x. [4] M. Bilenko. Learning to combine trained distancemetrics for duplicate detection in databases. Submitted to CIKM- 2002, (February):1–19, 2002. [5] Adomavicius, G & Tuzhilin, A (2005), Toward the next generation of recommender systems: A survey of the state- of-the-art and possible extensions, IEEE Transactions on Knowledge and Data Engineering, vol. 17, no. 6, pp.734-749. [6] J. L. Herlocker, J. A. Konstan, L. G. Terveen, & J. T. Riedl, Evaluating collaborative filtering recommender systems, ACM Transactions on Information Systems, vol. 22, no. 1,pp. 5-53, 2004.