IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Certain Issues in Web Page Prediction, Classification and Clustering in Data ...IJAEMSJORNAL
Nowadays, data mining which is a part of web mining plays a vital role in various applications such as search engines, health care centers for extracting the individual patient details among huge database, analyzing disease based on basic criteria, education system for analyzing their performance level with other system, social networking, E-Commerce and knowledge management etc., which extract the information based on the user query. The issues are time taken to mine the target content or webpage from the search engines, space complexity and predicting the frequent webpage for the next user based on users’ behaviour.
An Enhanced Approach for Detecting User's Behavior Applying Country-Wise Loca...IJSRD
The development of the web in past few years has created a lot of challenge in this field. The new work in this field is the search of the data in a search tree pattern based on tree. Various sequential mining algorithms have been devoloped till date. Web usage mining is used to operate the web server logs, that contains the navigation history of the user. Recommendater system is explained properly with the explanation of whole procedure of the recommendater system. The search results of the data leads to the proper ad efficient search. But the problem was the time utilization and the search results generated from them. So, a new local search algorithm is proposed for country-wise search that makes the searching more efficient on local results basis. This approach has lead to an advancement in the search based methods and the results generated.
MULTIFACTOR NAÏVE BAYES CLASSIFICATION FOR THE SLOW LEARNER PREDICTION OVER M...ijcsa
The high school students must be observed for their slow learning or quick learning abilities to provide
them with the best education practices. Such analysis can be perfectly performed over the student
performance data. The high school student data has been obtained from the schools from the various
regions in Punjab, a pivotal state of India. The complete student data and the selective data of almost 1300
students obtained from one school in the regions has been undergone the test using the proposed model in
this paper. The proposed model is based upon the naïve bayes classification model for the data
classification using the multi-factor features obtained from the input dataset. The subject groups have been
divided into the two primary groups: difficult and normal. The classification algorithm has been applied
individually over data grouped in the various subject groups. Both of the early stage classification events
have produced the almost similar results, whereas the results obtained from the classification events over
the averaging factors and the floating factors told the different story than the early stage classification. The
proposed model results have shown that the deep analysis of the data tells the in-depth facts from the input
data. The proposed model can be considered as the effectiv
AN INTELLIGENT OPTIMAL GENETIC MODEL TO INVESTIGATE THE USER USAGE BEHAVIOUR ...ijdkp
The unexpected wide spread use of WWW and dynamically increasing nature of the web creates new
challenges in the web mining since the data in the web inherently unlabelled, incomplete, non linear, and
heterogeneous. The investigation of user usage behaviour on WWW is real time problem which involves
multiple conflicting measures of performance. These measures make not only computational intensive but
also needs to the possibility of be unable to find the exact solution. Unfortunately, the conventional methods
are limited to optimization problems due to the absence of semantic certainty and presence of human
intervention. In handling such data and overcome the limitations of conventional methodologies it is
necessary to use a soft computing model that can work intelligently to attain optimal solution.
Identifying the Number of Visitors to improve Website Usability from Educatio...Editor IJCATR
Web usage mining deals with understanding the Visitor’s behaviour with a Website. It helps in understanding the concerns
such as present and future probability of every website user, relationship between behaviour and website usability. It has different
branches such as web content mining, web structure and web usage mining. The focus of this paper is on web mining usage patterns of
an educational institution web log data. There are three types of web related log data namely web access log, error log and proxy log
data. In this paper web access log data has been used as dataset because the web access log data is the typical source of navigational
behaviour of the website visitor. The study of web server log analysis is helpful in applying the web mining techniques.
International Journal of Engineering Research and DevelopmentIJERD Editor
Electrical, Electronics and Computer Engineering,
Information Engineering and Technology,
Mechanical, Industrial and Manufacturing Engineering,
Automation and Mechatronics Engineering,
Material and Chemical Engineering,
Civil and Architecture Engineering,
Biotechnology and Bio Engineering,
Environmental Engineering,
Petroleum and Mining Engineering,
Marine and Agriculture engineering,
Aerospace Engineering.
Certain Issues in Web Page Prediction, Classification and Clustering in Data ...IJAEMSJORNAL
Nowadays, data mining which is a part of web mining plays a vital role in various applications such as search engines, health care centers for extracting the individual patient details among huge database, analyzing disease based on basic criteria, education system for analyzing their performance level with other system, social networking, E-Commerce and knowledge management etc., which extract the information based on the user query. The issues are time taken to mine the target content or webpage from the search engines, space complexity and predicting the frequent webpage for the next user based on users’ behaviour.
An Enhanced Approach for Detecting User's Behavior Applying Country-Wise Loca...IJSRD
The development of the web in past few years has created a lot of challenge in this field. The new work in this field is the search of the data in a search tree pattern based on tree. Various sequential mining algorithms have been devoloped till date. Web usage mining is used to operate the web server logs, that contains the navigation history of the user. Recommendater system is explained properly with the explanation of whole procedure of the recommendater system. The search results of the data leads to the proper ad efficient search. But the problem was the time utilization and the search results generated from them. So, a new local search algorithm is proposed for country-wise search that makes the searching more efficient on local results basis. This approach has lead to an advancement in the search based methods and the results generated.
MULTIFACTOR NAÏVE BAYES CLASSIFICATION FOR THE SLOW LEARNER PREDICTION OVER M...ijcsa
The high school students must be observed for their slow learning or quick learning abilities to provide
them with the best education practices. Such analysis can be perfectly performed over the student
performance data. The high school student data has been obtained from the schools from the various
regions in Punjab, a pivotal state of India. The complete student data and the selective data of almost 1300
students obtained from one school in the regions has been undergone the test using the proposed model in
this paper. The proposed model is based upon the naïve bayes classification model for the data
classification using the multi-factor features obtained from the input dataset. The subject groups have been
divided into the two primary groups: difficult and normal. The classification algorithm has been applied
individually over data grouped in the various subject groups. Both of the early stage classification events
have produced the almost similar results, whereas the results obtained from the classification events over
the averaging factors and the floating factors told the different story than the early stage classification. The
proposed model results have shown that the deep analysis of the data tells the in-depth facts from the input
data. The proposed model can be considered as the effectiv
AN INTELLIGENT OPTIMAL GENETIC MODEL TO INVESTIGATE THE USER USAGE BEHAVIOUR ...ijdkp
The unexpected wide spread use of WWW and dynamically increasing nature of the web creates new
challenges in the web mining since the data in the web inherently unlabelled, incomplete, non linear, and
heterogeneous. The investigation of user usage behaviour on WWW is real time problem which involves
multiple conflicting measures of performance. These measures make not only computational intensive but
also needs to the possibility of be unable to find the exact solution. Unfortunately, the conventional methods
are limited to optimization problems due to the absence of semantic certainty and presence of human
intervention. In handling such data and overcome the limitations of conventional methodologies it is
necessary to use a soft computing model that can work intelligently to attain optimal solution.
Identifying the Number of Visitors to improve Website Usability from Educatio...Editor IJCATR
Web usage mining deals with understanding the Visitor’s behaviour with a Website. It helps in understanding the concerns
such as present and future probability of every website user, relationship between behaviour and website usability. It has different
branches such as web content mining, web structure and web usage mining. The focus of this paper is on web mining usage patterns of
an educational institution web log data. There are three types of web related log data namely web access log, error log and proxy log
data. In this paper web access log data has been used as dataset because the web access log data is the typical source of navigational
behaviour of the website visitor. The study of web server log analysis is helpful in applying the web mining techniques.
International Journal of Engineering Research and DevelopmentIJERD Editor
Electrical, Electronics and Computer Engineering,
Information Engineering and Technology,
Mechanical, Industrial and Manufacturing Engineering,
Automation and Mechatronics Engineering,
Material and Chemical Engineering,
Civil and Architecture Engineering,
Biotechnology and Bio Engineering,
Environmental Engineering,
Petroleum and Mining Engineering,
Marine and Agriculture engineering,
Aerospace Engineering.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
IJRET : International Journal of Research in Engineering and TechnologyImprov...eSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
BIDIRECTIONAL GROWTH BASED MINING AND CYCLIC BEHAVIOUR ANALYSIS OF WEB SEQUEN...ijdkp
Web sequential patterns are important for analyzing and understanding users’ behaviour to improve the
quality of service offered by the World Wide Web. Web Prefetching is one such technique that utilizes
prefetching rules derived through Cyclic Model Analysis of the mined Web sequential patterns. The more
accurate the prediction and more satisfying the results of prefetching if we use a highly efficient and
scalable mining technique such as the Bidirectional Growth based Directed Acyclic Graph. In this paper,
we propose a novel algorithm called Bidirectional Growth based mining Cyclic behavior Analysis of web
sequential Patterns (BGCAP) that effectively combines these strategies to generate prefetching rules in the
form of 2-sequence patterns with Periodicity and threshold of Cyclic Behaviour that can be utilized to
effectively prefetch Web pages, thus reducing the users’ perceived latency. As BGCAP is based on
Bidirectional pattern growth, it performs only (log n+1) levels of recursion for mining n Web sequential
patterns. Our experimental results show that prefetching rules generated using BGCAP is 5-10% faster for
different data sizes and 10-15% faster for a fixed data size than TD-Mine. In addition, BGCAP generates
about 5-15% more prefetching rules than TD-Mine.
IRJET-A Survey on Web Personalization of Web Usage MiningIRJET Journal
S.Jagan, Dr.S.P.Rajagopalan "A Survey on Web Personalization of Web Usage Mining", International Research Journal of Engineering and Technology (IRJET),Volume 2,issue-01 Mar-2015. e-ISSN:2395-0056, p-ISSN:2395-0072. www.irjet.net , published by Fast Track Publications
Abstract
Now a day, World Wide Web (www) is a rich and most powerful source of information. Day by day it is becoming more complex and expanding in size to get maximum information details online. However, it is becoming more complex and critical task to retrieve exact information expected by its users. To deal with this problem one more powerful concept is personalization which is becoming more powerful now days. Personalization is a subclass of information filtering system that seek to predict the 'ratings' or 'preferences' that a user would give to an items, they had not yet considered, using a model built from the characteristics of an item (content-based approaches or collaborative filtering approaches). Web mining is an emerging field of data mining used to provide personalization on the web. It consist three major categories i.e. Web Content Mining, Web Usage Mining, and Web Structure Mining. This paper focuses on web usage mining and algorithms used for providing personalization on the web.
Comparable Analysis of Web Mining Categoriestheijes
Web Data Mining is the current field of analysis which is a combination of two research area known as Data Mining and World Wide Web. Web Data Mining research associates with various research diversities like Database, Artificial Intelligence and Information redeem. The mining techniques are categorized into various categories namely Web Content Mining, Web Structure Mining and Web Usage Mining. In this work, analysis of mining techniques are done. From the analysis it has been concluded that Web Content Mining has unstructured or semi- structure view of data whereas Web Structure Mining have linked structure and Web Usage Mining mainly includes interaction.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
A survey on ontology based web personalizationeSAT Journals
Abstract Over the last decade the data on World Wide Web has been growing in an exponential manner. According to Google the data is accelerating with a speed of billion pages per day [24]. Internet has around 2 million users accessing the World Wide Web for various information [25].These numbers certainly raise a severe concern over information over load challenges for the users. Many researchers have been working to overcome the challenge with web personalization, many researchers are looking at ontology based web personalization as an answer to the information overload, as each individual is unique. In this paper we present an overview of ontology based web personalization, Challenges and a survey of the work. This paper also points future work in web personalization. Index Terms: Web Personalization, Ontology, User modeling, web usage mining.
Advance Clustering Technique Based on Markov Chain for Predicting Next User M...idescitation
According to the survey India is one of the
leading countries in the word for technical education and
management education. Numbers of students are increasing
day by day by the growth rate of 45% per annum. Advancement
in technology puts special effect on education system. This
helps in upgrading higher education. Some universities and
colleges are using these technologies. Weblog is one of them.
Main aim of this paper is to represent web logs using clustering
technique for predicting next user movement and user
behavior analysis. This paper moves around the web log
clustering technique based on Markov chain results .In this
paper we present an ideal approach to web clustering
(clustering web site users) and predicting their behavior for
next visit. Methodology: For generating effective result approx
14 engineering college web usage data is used and an advance
clustering approach is presenting after optimizing the other
clustering approach.Results: The user behavior is predicted
with the help of the advance clustering approach based on the
FPCM and k-mean. Proposed algorithm is used to mined and
predict user’s preferred paths. To predict the user behavior
existing approaches have been used. But the existing
approaches are not enough because of its reaction towards
noise. Thus with the help of ACM, noise is reduced, provides
more accurate result for predicting the user behavior. Approach
Implementation:The algorithm was implemented in MAT
LAB, DTRG and in Java .The experiment result proves that
this method is very effective in predicting user behavior. The
experimental results have validated the method’s effectiveness
in comparison with some previous studies.
The web has become a resourceful tool for almost all domains today. Search engines prominently use
inverted indexing technique to locate the web pages having the users query. The performance of inverted
index fundamentally depends upon the searching of keyword in the list maintained by search engine. Text
matching is done with the help of string matching algorithm. It is important to any string matching
algorithm to locate quickly the occurrences of the user specified pattern in large text. In this paper a new
string matching algorithm for keyword searching is proposed. The proposed algorithm relies on new
technique based on pattern length and FML (First-Middle-Last) character match. This proposed
algorithm is analysed and implemented. The extensive testing and comparisons are done with BoyerMoore, Naïve, Improved Naïve, Horspool and Zhu Takaoka. The result shows that the proposed
algorithm takes less time than other existing algorithm.
Classification-based Retrieval Methods to Enhance Information Discovery on th...IJMIT JOURNAL
The widespread adoption of the World-Wide Web (the Web) has created challenges both for society as a whole and for the technology used to build and maintain the Web. The ongoing struggle of information retrieval systems is to wade through this vast pile of data and satisfy users by presenting them with information that most adequately it’s their needs. On a societal level, the Web is expanding faster than we can comprehend its implications or develop rules for its use. The ubiquitous use of the Web has raised important social concerns in the areas of privacy, censorship, and access to information. On a technical level, the novelty of the Web and the pace of its growth have created challenges not only in the development of new applications that realize the power of the Web, but also in the technology needed to scale applications to accommodate the resulting large data sets and heavy loads. This thesis presents searching algorithms and hierarchical classification techniques for increasing a search service's understanding of web queries. Existing search services rely solely on a query's occurrence in the document collection to locate relevant documents. They typically do not perform any task or topic-based analysis of queries using other available resources, and do not leverage changes in user query patterns over time. Provided within are a set of techniques and metrics for performing temporal analysis on query logs. Our log analyses are shown to be reasonable and informative, and can be used to detect changing trends and patterns in the query stream, thus providing valuable data to a search service.
A Survey on: Utilizing of Different Features in Web Behavior PredictionEditor IJMTER
As the web user increases day by day, there are many websites which have a large
number of visitors at the same instant. So handing of these user required different technique. Out of
these requirements one emerging field is next page prediction, where as per the user navigation
pattern different features has been studied and predict the next page for the user. By this overall web
server response time is reduce. In this paper a detailed study of the different researcher paper has
shown, there techniques outcomes and list of features utilization such as web structure, web log, web
content.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Load balancing with switching mechanism in cloud computing environmenteSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
IJRET : International Journal of Research in Engineering and TechnologyImprov...eSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
BIDIRECTIONAL GROWTH BASED MINING AND CYCLIC BEHAVIOUR ANALYSIS OF WEB SEQUEN...ijdkp
Web sequential patterns are important for analyzing and understanding users’ behaviour to improve the
quality of service offered by the World Wide Web. Web Prefetching is one such technique that utilizes
prefetching rules derived through Cyclic Model Analysis of the mined Web sequential patterns. The more
accurate the prediction and more satisfying the results of prefetching if we use a highly efficient and
scalable mining technique such as the Bidirectional Growth based Directed Acyclic Graph. In this paper,
we propose a novel algorithm called Bidirectional Growth based mining Cyclic behavior Analysis of web
sequential Patterns (BGCAP) that effectively combines these strategies to generate prefetching rules in the
form of 2-sequence patterns with Periodicity and threshold of Cyclic Behaviour that can be utilized to
effectively prefetch Web pages, thus reducing the users’ perceived latency. As BGCAP is based on
Bidirectional pattern growth, it performs only (log n+1) levels of recursion for mining n Web sequential
patterns. Our experimental results show that prefetching rules generated using BGCAP is 5-10% faster for
different data sizes and 10-15% faster for a fixed data size than TD-Mine. In addition, BGCAP generates
about 5-15% more prefetching rules than TD-Mine.
IRJET-A Survey on Web Personalization of Web Usage MiningIRJET Journal
S.Jagan, Dr.S.P.Rajagopalan "A Survey on Web Personalization of Web Usage Mining", International Research Journal of Engineering and Technology (IRJET),Volume 2,issue-01 Mar-2015. e-ISSN:2395-0056, p-ISSN:2395-0072. www.irjet.net , published by Fast Track Publications
Abstract
Now a day, World Wide Web (www) is a rich and most powerful source of information. Day by day it is becoming more complex and expanding in size to get maximum information details online. However, it is becoming more complex and critical task to retrieve exact information expected by its users. To deal with this problem one more powerful concept is personalization which is becoming more powerful now days. Personalization is a subclass of information filtering system that seek to predict the 'ratings' or 'preferences' that a user would give to an items, they had not yet considered, using a model built from the characteristics of an item (content-based approaches or collaborative filtering approaches). Web mining is an emerging field of data mining used to provide personalization on the web. It consist three major categories i.e. Web Content Mining, Web Usage Mining, and Web Structure Mining. This paper focuses on web usage mining and algorithms used for providing personalization on the web.
Comparable Analysis of Web Mining Categoriestheijes
Web Data Mining is the current field of analysis which is a combination of two research area known as Data Mining and World Wide Web. Web Data Mining research associates with various research diversities like Database, Artificial Intelligence and Information redeem. The mining techniques are categorized into various categories namely Web Content Mining, Web Structure Mining and Web Usage Mining. In this work, analysis of mining techniques are done. From the analysis it has been concluded that Web Content Mining has unstructured or semi- structure view of data whereas Web Structure Mining have linked structure and Web Usage Mining mainly includes interaction.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
A survey on ontology based web personalizationeSAT Journals
Abstract Over the last decade the data on World Wide Web has been growing in an exponential manner. According to Google the data is accelerating with a speed of billion pages per day [24]. Internet has around 2 million users accessing the World Wide Web for various information [25].These numbers certainly raise a severe concern over information over load challenges for the users. Many researchers have been working to overcome the challenge with web personalization, many researchers are looking at ontology based web personalization as an answer to the information overload, as each individual is unique. In this paper we present an overview of ontology based web personalization, Challenges and a survey of the work. This paper also points future work in web personalization. Index Terms: Web Personalization, Ontology, User modeling, web usage mining.
Advance Clustering Technique Based on Markov Chain for Predicting Next User M...idescitation
According to the survey India is one of the
leading countries in the word for technical education and
management education. Numbers of students are increasing
day by day by the growth rate of 45% per annum. Advancement
in technology puts special effect on education system. This
helps in upgrading higher education. Some universities and
colleges are using these technologies. Weblog is one of them.
Main aim of this paper is to represent web logs using clustering
technique for predicting next user movement and user
behavior analysis. This paper moves around the web log
clustering technique based on Markov chain results .In this
paper we present an ideal approach to web clustering
(clustering web site users) and predicting their behavior for
next visit. Methodology: For generating effective result approx
14 engineering college web usage data is used and an advance
clustering approach is presenting after optimizing the other
clustering approach.Results: The user behavior is predicted
with the help of the advance clustering approach based on the
FPCM and k-mean. Proposed algorithm is used to mined and
predict user’s preferred paths. To predict the user behavior
existing approaches have been used. But the existing
approaches are not enough because of its reaction towards
noise. Thus with the help of ACM, noise is reduced, provides
more accurate result for predicting the user behavior. Approach
Implementation:The algorithm was implemented in MAT
LAB, DTRG and in Java .The experiment result proves that
this method is very effective in predicting user behavior. The
experimental results have validated the method’s effectiveness
in comparison with some previous studies.
The web has become a resourceful tool for almost all domains today. Search engines prominently use
inverted indexing technique to locate the web pages having the users query. The performance of inverted
index fundamentally depends upon the searching of keyword in the list maintained by search engine. Text
matching is done with the help of string matching algorithm. It is important to any string matching
algorithm to locate quickly the occurrences of the user specified pattern in large text. In this paper a new
string matching algorithm for keyword searching is proposed. The proposed algorithm relies on new
technique based on pattern length and FML (First-Middle-Last) character match. This proposed
algorithm is analysed and implemented. The extensive testing and comparisons are done with BoyerMoore, Naïve, Improved Naïve, Horspool and Zhu Takaoka. The result shows that the proposed
algorithm takes less time than other existing algorithm.
Classification-based Retrieval Methods to Enhance Information Discovery on th...IJMIT JOURNAL
The widespread adoption of the World-Wide Web (the Web) has created challenges both for society as a whole and for the technology used to build and maintain the Web. The ongoing struggle of information retrieval systems is to wade through this vast pile of data and satisfy users by presenting them with information that most adequately it’s their needs. On a societal level, the Web is expanding faster than we can comprehend its implications or develop rules for its use. The ubiquitous use of the Web has raised important social concerns in the areas of privacy, censorship, and access to information. On a technical level, the novelty of the Web and the pace of its growth have created challenges not only in the development of new applications that realize the power of the Web, but also in the technology needed to scale applications to accommodate the resulting large data sets and heavy loads. This thesis presents searching algorithms and hierarchical classification techniques for increasing a search service's understanding of web queries. Existing search services rely solely on a query's occurrence in the document collection to locate relevant documents. They typically do not perform any task or topic-based analysis of queries using other available resources, and do not leverage changes in user query patterns over time. Provided within are a set of techniques and metrics for performing temporal analysis on query logs. Our log analyses are shown to be reasonable and informative, and can be used to detect changing trends and patterns in the query stream, thus providing valuable data to a search service.
A Survey on: Utilizing of Different Features in Web Behavior PredictionEditor IJMTER
As the web user increases day by day, there are many websites which have a large
number of visitors at the same instant. So handing of these user required different technique. Out of
these requirements one emerging field is next page prediction, where as per the user navigation
pattern different features has been studied and predict the next page for the user. By this overall web
server response time is reduce. In this paper a detailed study of the different researcher paper has
shown, there techniques outcomes and list of features utilization such as web structure, web log, web
content.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Load balancing with switching mechanism in cloud computing environmenteSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Identifying e learner’s opinion using automated sentiment analysis in e-learningeSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Automatic collision detection for an autonomous robot using proximity sensing...eSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Microstrip circular patch array antenna for electronic toll collectioneSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Load balancing in public cloud by division of cloud based on the geographical...eSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
Effect of bacteria on partial replacement of concrete with fly ash and ggbseSAT Publishing House
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Semantically enriched web usage mining for predicting user future movementsIJwest
Explosive and quick growth of the World Wide Web has resulted in intricate Web sites, demanding
enhanced user skills and sophisticated tools to help the Web user to find the desi
red information. Finding
desired information on the Web has become a critical ingredient of everyday personal, educational, and
business life. Thus, there is a demand for more sophisticated tools to help the user to navigate a Web site
and find the desired
information. The users must be provided with information and services specific to
their needs, rather than an undiffere
ntiated mass of information.
For discovering interesting and frequent
navigation patterns from Web server logs many Web usage mining te
chniques have been applied. The
recommendation accuracy of solely usage based techniques can be improved by integrating Web site
content and site structure in the personalization process.
Herein, we propose Semantically enriched Web Usage Mining method (S
WUM), which combines the fields
of Web Usage Mining and Semantic Web. In the proposed method, the undirected graph derived from
usage data is enriched with rich semantic information extracted from the Web pages and the Web site
structure. The experimental
results show that the SWUM generates accurate recommendations with
integration of usage, semantic data and Web site structure. The results shows that proposed method is able
to achieve 10
-
20%
better accuracy than the solely usage based model, and 5
-
8% bet
ter than an ontology
based model.
An effective search on web log from most popular downloaded contentijdpsjournal
A Web page recommender system effectively predicts the best related web page to search. While search
ing
a word from search engine it may display some unnecessary links and unrelated data’s to user so to a
void
this problem, the con
ceptual prediction model combines both the web usage and domain knowledge. The
proposed conceptual prediction model automatically generates a semantic network of the semantic Web
usage knowledge, which is the integration of domain knowledge and web usage i
nformation. Web usage
mining aims to discover interesting and frequent user access patterns from web browsing data. The
discovered knowledge can then be used for many practical web applications such as web
recommendations, adaptive web sites, and personali
zed web search and surfing
International conference On Computer Science And technologyanchalsinghdm
ICGCET 2019 | 5th International Conference on Green Computing and Engineering Technologies. The conference will be held on 7th September - 9th September 2019 in Morocco. International Conference On Engineering Technology
The conference aims to promote the work of researchers, scientists, engineers and students from across the world on advancement in electronic and computer systems.
Web Page Recommendation Using Web MiningIJERA Editor
On World Wide Web various kind of content are generated in huge amount, so to give relevant result to user web recommendation become important part of web application. On web different kind of web recommendation are made available to user every day that includes Image, Video, Audio, query suggestion and web page. In this paper we are aiming at providing framework for web page recommendation. 1) First we describe the basics of web mining, types of web mining. 2) Details of each web mining technique.3)We propose the architecture for the personalized web page recommendation.
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logsijsrd.com
With an expontial growth of World Wide Web, there are so many information overloaded and it became hard to find out data according to need. Web usage mining is a part of web mining, which deal with automatic discovery of user navigation pattern from web log. This paper presents an overview of web mining and also provide navigation pattern from classification and clustering algorithm for web usage mining. Web usage mining contain three important task namely data preprocessing, pattern discovery and pattern analysis based on discovered pattern. And also contain the comparative study of web mining techniques.
MULTIFACTOR NAÏVE BAYES CLASSIFICATION FOR THE SLOW LEARNER PREDICTION OVER M...ijcsa
The high school students must be observed for their slow learning or quick learning abilities to provide
them with the best education practices. Such analysis can be perfectly performed over the student
performance data. The high school student data has been obtained from the schools from the various
regions in Punjab, a pivotal state of India. The complete student data and the selective data of almost 1300
students obtained from one school in the regions has been undergone the test using the proposed model in
this paper. The proposed model is based upon the naïve bayes classification model for the data
classification using the multi-factor features obtained from the input dataset. The subject groups have been
divided into the two primary groups: difficult and normal. The classification algorithm has been applied
individually over data grouped in the various subject groups. Both of the early stage classification events
have produced the almost similar results, whereas the results obtained from the classification events over
the averaging factors and the floating factors told the different story than the early stage classification. The
proposed model results have shown that the deep analysis of the data tells the in-depth facts from the input
data. The proposed model can be considered as the effective classification model when evaluated from the
results described in the earlier sections.
A NEW IMPROVED WEIGHTED ASSOCIATION RULE MINING WITH DYNAMIC PROGRAMMING APPR...cscpconf
With the rapid development of Internet, Web search has been taken an important role in our
ordinary life. In web search, mining frequent patterns in large database is a major research area. Due to increase of user activities on web, web-searching methods, to predict the nextrequest of user visits in web pages plays a major role. Web searching methods are helpful to provide quality results, timely answer and also offer a customized navigation. In web search, Association rule mining is an important data analysis method to discover associated web pages. Most of the researchers implemented association mining using Apriori algorithm with binary representation. The problem of this approach is not address the issue like the navigation order of web pages. To overcome this problem researchers proposed a weighted Apriori to maintain navigation order but unable to produce optimal results. With the goal of a most favorable result we proposed a novel approach which combines weighted Apriori and dynamic programming. The experimental result shows that this approach maintains the navigation order of web pages and achieves a best solution. The proposed technique enhances the web site effectiveness, increases the user browsing knowledge, improves the prediction accuracy and decreases the computational complexities.
CONTENT AND USER CLICK BASED PAGE RANKING FOR IMPROVED WEB INFORMATION RETRIEVALijcsa
Search engines today are retrieving more than a few thousand web pages for a single query, most of which
are irrelevant. Listing results according to user needs is, therefore, a very real necessity. The challenge lies
in ordering retrieved pages and presenting them to users in line with their interests. Search engines,
therefore, utilize page rank algorithms to analyze and re-rank search results according to the relevance of
the user’s query by estimating (over the web) the importance of a web page. The proposed work
investigates web page ranking methods and recently-developed improvements in web page ranking.
Further, a new content-based web page rank technique is also proposed for implementation. The proposed
technique finds out how important a particular web page is by evaluating the data a user has clicked on, as
well as the contents available on these web pages. The results demonstrate the effectiveness of the proposed
page ranking technique and its efficiency.
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...inventionjournals
Information is overloaded in the Internet due to the unstable growth of information and it makes information search as complicate process. Recommendation System (RS) is the tool and largely used nowadays in many areas to generate interest items to users. With the development of e-commerce and information access, recommender systems have become a popular technique to prune large information spaces so that users are directed toward those items that best meet their needs and preferences. As the exponential explosion of various contents generated on the Web, Recommendation techniques have become increasingly indispensable. Web recommendation systems assist the users to get the exact information and facilitate the information search easier. Web recommendation is one of the techniques of web personalization, which recommends web pages or items to the user based on the previous browsing history. But the tremendous growth in the amount of the available information and the number of visitors to web sites in recent years places some key challenges for recommender system. The recent recommender systems stuck with producing high quality recommendation with large information, resulting unwanted item instead of targeted item or product, and performing many recommendations per second for millions of user and items. To avoid these challenges a new recommender system technologies are needed that can quickly produce high quality recommendation, even for a very large scale problems. To address these issues we use two recommender system process using fuzzy clustering and collaborative filtering algorithms. Fuzzy clustering is used to predict the items or product that will be accessed in the future based on the previous action of user browsers behavior. Collaborative filtering recommendation process is used to produce the user expects result from the result of fuzzy clustering and collection of Web Database data items. Using this new recommendation system, it results the user expected product or item with minimum time. This system reduces the result of unrelated and unwanted item to user and provides the results with user interested domain.
In this world of information technology, everyone has the tendency to do business electronically. Today
lot of businesses are happening on World Wide Web (WWW), it is very important for the website owner to
provide a better platform to attract more customers for their site. Providing information in a better way is
the solution to bring more customers or users. Customer is the end-user, who accessing the information
in a way it yields some credit to the web site owners. In this paper we define web mining and present a
method to utilize web mining in a better way to know the users and website behaviour which in turn
enhance the web site information to attract more users. This paper also presents an overview of the
various researches done on pattern extraction, web content mining and how it can be taken as a catalyst
for E-business.
A Web Extraction Using Soft Algorithm for Trinity Structureiosrjce
IOSR Journal of Computer Engineering (IOSR-JCE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of computer engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in computer technology. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
Welcome to WIPAC Monthly the magazine brought to you by the LinkedIn Group Water Industry Process Automation & Control.
In this month's edition, along with this month's industry news to celebrate the 13 years since the group was created we have articles including
A case study of the used of Advanced Process Control at the Wastewater Treatment works at Lleida in Spain
A look back on an article on smart wastewater networks in order to see how the industry has measured up in the interim around the adoption of Digital Transformation in the Water Industry.
Saudi Arabia stands as a titan in the global energy landscape, renowned for its abundant oil and gas resources. It's the largest exporter of petroleum and holds some of the world's most significant reserves. Let's delve into the top 10 oil and gas projects shaping Saudi Arabia's energy future in 2024.
Final project report on grocery store management system..pdfKamal Acharya
In today’s fast-changing business environment, it’s extremely important to be able to respond to client needs in the most effective and timely manner. If your customers wish to see your business online and have instant access to your products or services.
Online Grocery Store is an e-commerce website, which retails various grocery products. This project allows viewing various products available enables registered users to purchase desired products instantly using Paytm, UPI payment processor (Instant Pay) and also can place order by using Cash on Delivery (Pay Later) option. This project provides an easy access to Administrators and Managers to view orders placed using Pay Later and Instant Pay options.
In order to develop an e-commerce website, a number of Technologies must be studied and understood. These include multi-tiered architecture, server and client-side scripting techniques, implementation technologies, programming language (such as PHP, HTML, CSS, JavaScript) and MySQL relational databases. This is a project with the objective to develop a basic website where a consumer is provided with a shopping cart website and also to know about the technologies used to develop such a website.
This document will discuss each of the underlying technologies to create and implement an e- commerce website.
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdffxintegritypublishin
Advancements in technology unveil a myriad of electrical and electronic breakthroughs geared towards efficiently harnessing limited resources to meet human energy demands. The optimization of hybrid solar PV panels and pumped hydro energy supply systems plays a pivotal role in utilizing natural resources effectively. This initiative not only benefits humanity but also fosters environmental sustainability. The study investigated the design optimization of these hybrid systems, focusing on understanding solar radiation patterns, identifying geographical influences on solar radiation, formulating a mathematical model for system optimization, and determining the optimal configuration of PV panels and pumped hydro storage. Through a comparative analysis approach and eight weeks of data collection, the study addressed key research questions related to solar radiation patterns and optimal system design. The findings highlighted regions with heightened solar radiation levels, showcasing substantial potential for power generation and emphasizing the system's efficiency. Optimizing system design significantly boosted power generation, promoted renewable energy utilization, and enhanced energy storage capacity. The study underscored the benefits of optimizing hybrid solar PV panels and pumped hydro energy supply systems for sustainable energy usage. Optimizing the design of solar PV panels and pumped hydro energy supply systems as examined across diverse climatic conditions in a developing country, not only enhances power generation but also improves the integration of renewable energy sources and boosts energy storage capacities, particularly beneficial for less economically prosperous regions. Additionally, the study provides valuable insights for advancing energy research in economically viable areas. Recommendations included conducting site-specific assessments, utilizing advanced modeling tools, implementing regular maintenance protocols, and enhancing communication among system components.
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)MdTanvirMahtab2
This presentation is about the working procedure of Shahjalal Fertilizer Company Limited (SFCL). A Govt. owned Company of Bangladesh Chemical Industries Corporation under Ministry of Industries.
HEAP SORT ILLUSTRATED WITH HEAPIFY, BUILD HEAP FOR DYNAMIC ARRAYS.
Heap sort is a comparison-based sorting technique based on Binary Heap data structure. It is similar to the selection sort where we first find the minimum element and place the minimum element at the beginning. Repeat the same process for the remaining elements.
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...Amil Baba Dawood bangali
Contact with Dawood Bhai Just call on +92322-6382012 and we'll help you. We'll solve all your problems within 12 to 24 hours and with 101% guarantee and with astrology systematic. If you want to take any personal or professional advice then also you can call us on +92322-6382012 , ONLINE LOVE PROBLEM & Other all types of Daily Life Problem's.Then CALL or WHATSAPP us on +92322-6382012 and Get all these problems solutions here by Amil Baba DAWOOD BANGALI
#vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore#blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #blackmagicforlove #blackmagicformarriage #aamilbaba #kalajadu #kalailam #taweez #wazifaexpert #jadumantar #vashikaranspecialist #astrologer #palmistry #amliyaat #taweez #manpasandshadi #horoscope #spiritual #lovelife #lovespell #marriagespell#aamilbabainpakistan #amilbabainkarachi #powerfullblackmagicspell #kalajadumantarspecialist #realamilbaba #AmilbabainPakistan #astrologerincanada #astrologerindubai #lovespellsmaster #kalajaduspecialist #lovespellsthatwork #aamilbabainlahore #Amilbabainuk #amilbabainspain #amilbabaindubai #Amilbabainnorway #amilbabainkrachi #amilbabainlahore #amilbabaingujranwalan #amilbabainislamabad
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Dr.Costas Sachpazis
Terzaghi's soil bearing capacity theory, developed by Karl Terzaghi, is a fundamental principle in geotechnical engineering used to determine the bearing capacity of shallow foundations. This theory provides a method to calculate the ultimate bearing capacity of soil, which is the maximum load per unit area that the soil can support without undergoing shear failure. The Calculation HTML Code included.
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Recommendation generation by integrating sequential
1. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
__________________________________________________________________________________________
Volume: 03 Issue: 01 | Jan-2014, Available @ http://www.ijret.org 201
RECOMMENDATION GENERATION BY INTEGRATING SEQUENTIAL
PATTERN MINING AND SEMANTICS
Geethapriya Uvaraja
Computer Science and Engineering Anna University, Tamilnadu, India
Abstract
As the Internet usage keeps increasing, the number of web sites and hence the number of web pages also keeps increasing. A
recommendation system can be used to provide personalized web service by suggesting the pages that are likely to be accessed in
future. Most of the recommendation systems are based on association rule mining or based on keywords. Using the association rule
mining the prediction rate is less as it doesn’t take into account the order of access of the web pages by the users. The
recommendation systems that are key-word based provides lesser relevant results. This paper proposes a recommendation system that
uses the advantages of sequential pattern mining and semantics over the association rule mining and keyword based systems
respectively.
Keywords: Sequential Pattern Mining, Taxonomy, Apriori-All, CS-Mine, Semantic, Clustering
----------------------------------------------------------------------***------------------------------------------------------------------------
1. INTRODUCTION
A recommender system understands the users’ navigation
pattern by exploiting web usage mining and provides
personalization service based on the results of mining. The
recommender system will propose links of possible interest to
the user. Data mining and machine learning algorithms are
used in developing a recommendation system. In general the
recommendation systems take the users’ navigations in the
form of log file. Some kind of mining is performed over the
data in the log file to find the usage patterns. When a new user
arrives, the current access is matched with the patterns that are
mined to generate recommendations.
The semantic information is used along with web usage data
to get more relevant patterns. Since web logs lack semantic
information about web pages visited by users, it is difficult to
understand the preferences and intents of users. With the
development of the Semantic Web, semantics in web content
can be used for improving the relevancy of recommendation.
The objective of this paper is to provide recommendations by
integrating sequential pattern mining and semantics. The
sequential patterns obtained from two sequential pattern
mining algorithms Apriori-All and Conditional Sequence
Mine are compared. Taxonomy is constructed for the website
for which the recommendations are to be generated. The
keywords representing each web page of the website are found
and they are mapped with the taxonomy to obtain the
categories by using the similarity measure. The documents are
then clustered based on the categories. The recommendation
rules generated by sequential pattern mining algorithm and the
clusters are used to generate recommendations.
The rest of the paper is organized as follows. Section 2
describes the literature survey and the related work. Section 3
presents the system design. Section 4 describes the results of
the modules implemented and performance evaluation.
Section 5 is about the conclusion.
2. LITERATURE SURVEY AND RELATED
WORK
Web mining is the process of mining or extracting useful or
valuable information on web. Web mining is an application of
data mining techniques on World Wide Web to extract
patterns from resources available on web. Web mining has
three categories. They are
1. Web content mining
2. Web usage mining
3. Web structure mining
Web content mining is the discovery of patterns from contents
of the web pages for improving the relevancy in the field of
web mining. It involves information extraction, knowledge
discovery and analysis of a collection of documents. Web
usage mining is the discovery of the patterns from user
navigation data in the form of log file. Web structure mining is
the discovery of knowledge from hyperlinks and link structure
between the web pages.
A recommendation system incorporates the web mining
techniques in order to provide personalized suggestions to the
users. There are various approaches through which
recommendations can be generated. Most of the traditional
recommendation systems are based on clustering, the user
given rating and feedback provided by the users. Due to the
2. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
__________________________________________________________________________________________
Volume: 03 Issue: 01 | Jan-2014, Available @ http://www.ijret.org 202
presence of a huge number of meaningful clusters and profiles
for visitors of a typical highly rated Website, the model-based
or distance-based method tend to make too strong and
simplistic assumptions and becomes excessively complex and
slow. Collaborative filtering systems predict a person’s
affinity for items or information by connecting that person’s
recorded interests with the recorded interests of a community
of people and sharing ratings between likeminded persons.
This approach is based on an assumption that those who
agreed in the past tend to agree again in the future. It cannot
acquire accurate recommendation results when user rating data
are extremely sparse. This approach can make suggestions to a
user that are outside the scope of previous selected items. But
it not scalable when number of users and items increases. A
content-based filtering system selects items based on the
correlation between the content of the items and the user’s
preferences as opposed to a collaborative filtering system that
chooses items based on the correlation between people with
similar preferences.
In the personalized recommendation systems, which are based
on web usage mining, association-mining technology was
applied to predict user-browsing behavior. This method scales
better with large datasets compared to systems based on
clustering. Weighted association rule model was proposed by
assigning a significant weight to the pages based on time spent
by each user on each page and visiting frequency of each
page. The weighting measure was used to judge the
importance of a page to a user, and try to give more
consideration to pages which are more useful to the user. This
method is better in precision and coverage rates than the
conventional association rule based recommendation [8]. The
drawback with this approach is that the idle time of the user on
a web page cannot be differentiated from the actual time spent
by the user on a page. A recommendation method was
proposed to be applied to web log mining by integrating user
clustering and association rule mining techniques. The
precision by this method was better compared to association
rule based recommendation system. But there is no much
improvement in coverage ratio and speed [7].
Semantic Web Mining aims at combining the two fast-
developing research areas Semantic Web and Web Mining.
Semantic web makes the web contents understandable not
only by human but also by the machine. Machine processable
information can provide more relevant results and can
improve the precision and recall. To achieve this, the content
of the semantic web are mostly represented by ontology,
XML, RDF and meta-data. Ontology Learning is a method for
extraction of semantics from the Web in order to create
ontology. Machine learning techniques were used to improve
the ontology engineering process. Semantic Web Mining
improves the results of Web Mining by exploiting the new
semantic structures in the web [10].
3. SYSTEM DESIGN
The overall system architecture of the proposed
recommendation system is the Fig. 1.
Fig. 1 System Design
System Description
The dataset used is the web server log file (web usage data) of
the website http://cs.annauniv.edu and the web pages (web
content data) of the same site. Each module in the designed
recommendation system is explained as follows.
3.1 Pre-processing
The objective of pre-processing is to reduce the size of the
web usage data (web server log) and to increase the quality of
the data that will be suitable for mining. Web usage data
contains lot of noise. The log file contains records for the
requests to images, multimedia files or script files. Some
records are due to the requests made by the web crawlers. Log
file also contain the requests that are not processed. Those
records are not necessary for analysis and hence they are
eliminated. The un-processed requests can be identified using
the http status code. If the requested page has images many
records are created for a single web page request. Such
repetitive records are pruned. A web server would receive
request from multiple users at same time. The records for the
request from different users will be interleaved. The session
for each user has to be identified. A user can be identified as a
combination of the IP address and the agent. The list of web
pages requested by a user forms a web access list which is
input to the sequential pattern mining algorithms.
3.2 Sequential Pattern Mining
The sequential pattern mining is the mining of frequently
occurring patterns ordered by time. Using sequential pattern
mining, one can identify the paths that users frequently follow
3. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
__________________________________________________________________________________________
Volume: 03 Issue: 01 | Jan-2014, Available @ http://www.ijret.org 203
on a web site and hence it increases the prediction rate.
Sequential pattern mining is well suited for log study due to
the sequential nature of web users’ activity. In this paper the
frequent sequence patterns generated by two sequential pattern
mining algorithms AprioriAll and CS-Mine are evaluated.
Apriori-All is similar to the well known Apriori algorithm but
the difference being that the Apriori-All considers the ordering
of the items in a transaction where as Apriori doesn’t take into
account the order of items in a transaction. Hence Apriori-All
can be said as the sequential version of Apriori [3].
Conditional Sequence Mine algorithm works on the
conditional sequence of each frequent pattern given a
sequence database where each sequence is a list of
transactions ordered by transaction time as an input to the
algorithm [1].
3.3 Pattern Tree Construction
A pattern tree is used to store the sequential patterns
compactly. A single scan of sequential access patterns
generated by the AprioriAll or Conditional Sequence mine
algorithms is necessary to construct a pattern tree. A pattern
tree is based on trie data structure. Trie is used for storing
strings to enable fast pattern matching. The root node of the
pattern tree is a dummy node. All other nodes have a web
access and its support. All the sequential access patterns in the
pattern tree can be visited by following the path starting from
the root node of the tree.
3.4 Sequence Pattern Based Recommendation Engine
This component searches for the best matching access path in
the Pattern-tree for the given access sequence. The given
access sequence is matched with the nodes in the pattern tree
and the web accesses in the child nodes are generated as
recommendations. The recommendations generated are in the
descending order of the support of the web accesses i.e., the
most frequent access is suggested first.
The suffix sequences of the current access sequence will be
considered when the matching path of the whole access
sequence cannot be found. Search will be performed on the
matching path based on the same access sequence by
removing the first item repeatedly until a matching path is
found or when no more items can be removed from the access
sequence. The length of the longest path in the Pattern-tree is
the depth of the Pattern-tree. The matching path will not exist
when the length of the current access sequence is longer than
the depth of the Pattern-tree. Therefore, some initial items can
be removed to make the current access sequence shorter than
the depth of the Pattern-tree before the sequence matching
process.
The recommendation rules generated by this engine are later
used by semantic based recommendation engine to generate
recommendations based on semantics of the web contents.
3.5 Keyword Extraction
The web pages are parsed, the tags are removed and the web
contents are extracted. A text document is created
corresponding to each page. Most common words (stop
words) are removed from each document. The significant
keywords for each document are identified using TF*IDF.
TF*IDF is a statistical measure used to evaluate how
important a word is to a document in a collection. The
importance increases proportionally to the number of times a
word appears in the document but is offset by the frequency of
the word in the corpus.
3.6 Taxonomy Construction
The taxonomy is a concept hierarchy that provides a means for
designing enhanced searching, browsing and filtering systems.
A domain specific taxonomy is constructed using XML. The
keywords extracted in the previous step have to be mapped
with the elements in the taxonomy to get the categories. Hence
the taxonomy has a greater influence on the outcome of the
mapping process. For this purpose the taxonomy has to be
semantically related to the contents of the website.
3.7 Mapping Keywords and the Categories
Keywords are the representatives of the contents of the web
pages. These keywords are mapped with the categories in the
taxonomy using the thesaurus (Wordnet). If a keyword exists
in the taxonomy, it is included. If it doesn’t exist, a closest
category in the taxonomy is found by making use of the
thesaurus. In this paper the closest category in the taxonomy is
found using Jiang and Conrath similarity measure. Now the
documents are represented as the categories in the taxonomy.
3.8 Document Clustering
As the web pages are now represented as the categories in the
taxonomy, they are clustered based on the similarity between
the categories in the taxonomy. Clustering based semantics
would aid the recommendation generation easier. DBSCAN is
a density based clustering algorithm used for document
clustering.
3.9 Semantic Based Recommendation Engine
The recommendation rules generated by the sequential pattern
based recommendation engine and the document clusters are
provided as input to the semantic based recommendation
engine. Every document is already assigned to a relevant
cluster. Given a recommendation rule, all the URLs in the
cluster in which the URL in the right hand side of the rule
belongs are generated as recommendations [10].
4. RESULTS
Data set used is the log file of the website
http://cs.annauniv.edu. The log file is assumed to be static for
4. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
__________________________________________________________________________________________
Volume: 03 Issue: 01 | Jan-2014, Available @ http://www.ijret.org 204
experimental purpose. Number of records in the log file is
140283. After preprocessing the size of the records were
reduced to 15% of the initial size. Web access lists are
extracted from the preprocessed records. The web access lists
generated are used by the sequential pattern mining algorithms
Apriori-All and CS-Mine and their performance is evaluated.
Fig. 2 Run time vs. Threshold Support
Fig 2 shows that the run time of CS-Mine is lesser compared
to AprioriAll. This is because AprioriAll generates huge set of
candidate sequences and needs many scans of the sequence
database. After a certain support threshold, say 6% the run
time of both the algorithms becomes equal. The run time of
both the algorithms decreases as the support threshold
increases.
Fig. 3 Number of patterns generated vs. threshold support
Fig 3 shows that the number of patterns generated by
AprioriAll is greater than that of the CS-Mine algorithm.
Though the number of patterns generated by Apriori-All was
greater than that of the CS-Mine, recommendation engine
generated same recommendations using both the algorithms at
all the threshold support. Hence CS-Mine which is lesser time
consuming by generating fewer patterns is found to be
efficient in terms of recommendation generation also.
Precision is an evaluation measure used to find how probable
a user will access one of the recommended pages.
Precision = Number of correct recommendations / Total
number of recommendations
Fig. 4 Precision percentage vs. No. of pages recommended
Fig 4 shows that the precision of the recommendation system
increases as the number of the recommendations generated
increases. Since many recommendations will be generated at
lower support threshold, the precision will be high at low
threshold support.
CONCLUSIONS
In this paper the recommendations are provided by exploiting
two sequential pattern mining algorithms – AprioriAll and CS-
Mine and it has been shown that the CS-Mine algorithm is
efficient in terms of time, generating less number of patterns.
The recommendation obtained using both the algorithms are
same. Rules generated by the sequential pattern based engine
are then used to provide semantic based recommendations.
Sequential pattern based recommendations are generated by
matching the current access of a user with paths in the pattern
tree. During the pattern matching, only few paths in the
pattern tree are matched. Hence there is a reduction in time as
not all the paths in the tree are traversed. Also there is a
reduction in space as all the sequences are compacted in this
pattern tree and hence the sequences with same prefixes share
common sub paths. The time consuming part of this approach
is the pattern tree construction. In real time the pattern tree
construction need not be performed for every request and
hence it doesn’t greatly affect the time when providing
suggestions. One of the challenging problems in this
recommendation system is when a page is newly visited and is
not in the pattern tree, no recommendations are provided
initially. When the pattern tree is updated with the new access
sequences then the recommendations will be generated.
5. IJRET: International Journal of Research in Engineering and Technology eISSN: 2319-1163 | pISSN: 2321-7308
__________________________________________________________________________________________
Volume: 03 Issue: 01 | Jan-2014, Available @ http://www.ijret.org 205
REFERENCES
[1]. Xiaogang Wang; Yan Bai; Yue Li: “An Information
Retrieval Method Based On Sequential Access Patterns”
Wearable Computing Systems (APWCS), 2010 Asia-Pacific
Conference, April 2010, pp: 247 - 250
[2]. Jaideep Srivastava, Robert Cooley, Mukund Deshpande,
Pang-Ning Tan: “Web Usage Mining: Discovery and
Applications of Usage Patterns from Web Data” ACM
SIGKDD Volume 1, Issue 2, January 2000, pp: 12 – 23
[3]. WANG Tong, HE Pi-lian: “Web Log Mining by an
Improved AprioriAll Algorithm” World Academy of Science,
Engineering and Technology, April 2005
[4]. Baoyao Zhou, Siu Cheung Hui, Kuiyu Chang: “An
Intelligent Recommender System using Sequential Web
access patterns” Cybernetics and Intelligent Systems, 2004
IEEE Conference, 2004, pp: 393-398
[5]. Yi Dong, Huiying Zhang, Linnan Jiao: “Research on
Application of User Navigation Pattern mining
recommendation” Intelligent Control and Automation, 2006.
The Sixth World Congress, 2006, pp: 6106 - 6110
[6]. Forsati, R: Meybodi, M.R.; Rahbar, A: “An Efficient
Algorithm for Web Recommendation Systems” Computer
Systems and Applications, 2009.IEEE/ACS International
Conference, 2009, pp: 579 - 586
[7]. Jaideep Srivastava , Robert Cooleyz , Mukund
Deshpande, Pang-Ning Tan: “Web Usage Mining: Discovery
and Applications of Usage patterns from web data”
ACM SIGKDD Volume 1, Issue 2, Jan 2000, pp: 12-23
[8]. Bettina Berendt, Andreas Hotho, and Gerd Stumme:
“Towards Semantic Web Mining”, Horrocks and J. Hendler
(Eds.): ISWC 2002, LNCS 2342, 2002, pp. 264–278
[9]. Lappas, G: “An Overview of Web Mining in Societal
Benefit Areas” 4th IEEE International Conference on
Enterprise Computing, E-Commerce and E-Services, 2007,
pp: 683-690
[10]. M. Eirinaki, M. Vazirgiannis, I. Varlamis: “SEWeP:
Using Site Semantics and a Taxonomy to Enhance the Web
Personalization Process” SIGKDD ’03, August 24-27, 2003
BIOGRAPHIE:
Geethapriya Uvaraja, Post-Graduate from Anna
University, Chennai