Web structure mining analyzes the hyperlink structure of websites to extract useful information. It involves discovering patterns in how webpages link to each other. This can help determine the importance or relevance of individual pages. The document discusses web structure mining techniques for analyzing link patterns and relationships between webpages in order to classify pages, identify clusters of related pages, and determine the strength or type of connections between pages. It focuses on using these techniques for online booking domains.
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
The papers for publication in The International Journal of Engineering& Science are selected through rigorous peer reviews to ensure originality, timeliness, relevance, and readability.
Web Page Recommendation Using Web MiningIJERA Editor
On World Wide Web various kind of content are generated in huge amount, so to give relevant result to user web recommendation become important part of web application. On web different kind of web recommendation are made available to user every day that includes Image, Video, Audio, query suggestion and web page. In this paper we are aiming at providing framework for web page recommendation. 1) First we describe the basics of web mining, types of web mining. 2) Details of each web mining technique.3)We propose the architecture for the personalized web page recommendation.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
The papers for publication in The International Journal of Engineering& Science are selected through rigorous peer reviews to ensure originality, timeliness, relevance, and readability.
Web Page Recommendation Using Web MiningIJERA Editor
On World Wide Web various kind of content are generated in huge amount, so to give relevant result to user web recommendation become important part of web application. On web different kind of web recommendation are made available to user every day that includes Image, Video, Audio, query suggestion and web page. In this paper we are aiming at providing framework for web page recommendation. 1) First we describe the basics of web mining, types of web mining. 2) Details of each web mining technique.3)We propose the architecture for the personalized web page recommendation.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Web personalization using clustering of web usage dataijfcstjournal
The exponential growth in the number and the complexity of information resources and services on the Web
has made log data an indispensable resource to characterize the users for Web-based environment. It
creates information of related web data in the form of hierarchy structure through approximation. This
hierarchy structure can be used as the input for a variety of data mining tasks such as clustering,
association rule mining, sequence mining etc.
In this paper, we present an approach for personalizing web user environment dynamically when he
interacting with web by clustering of web usage data using concept hierarchy. The system is inferred from
the web server’s access logs by means of data and web usage mining techniques to extract the information
about users. The extracted knowledge is used for the purpose of offering a personalized view of the
services to users.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting valuable information from web databases. Web mining is one of the data mining domains where data mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer behaviour, evaluate a particular website based on the information which is stored in web log files. Web mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library. Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase the complexity of dealing information from different web service providers. The collection of information becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting valuable information from web databases. Web mining is one of the data mining domains where data mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer behaviour, evaluate a particular website based on the information which is stored in web log files. Web mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library. Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase the complexity of dealing information from different web service providers. The collection of information becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting valuable information from web databases. Web mining is one of the data mining domains where data mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer behaviour, evaluate a particular website based on the information which is stored in web log files. Web mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library. Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase the complexity of dealing information from different web service providers. The collection of information becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
In this world of information technology, everyone has the tendency to do business electronically. Today
lot of businesses are happening on World Wide Web (WWW), it is very important for the website owner to
provide a better platform to attract more customers for their site. Providing information in a better way is
the solution to bring more customers or users. Customer is the end-user, who accessing the information
in a way it yields some credit to the web site owners. In this paper we define web mining and present a
method to utilize web mining in a better way to know the users and website behaviour which in turn
enhance the web site information to attract more users. This paper also presents an overview of the
various researches done on pattern extraction, web content mining and how it can be taken as a catalyst
for E-business.
Abstract: In many fields, such as industry, commerce, government, and education, knowledge discovery and data
mining can be immensely valuable to the subject of Artificial Intelligence. Because of the recent increase in
demand for KDD techniques, such as those used in machine learning, databases, statistics, knowledge acquisition,
data visualisation, and high performance computing, knowledge discovery and data mining have grown in
importance. By employing standard formulas for computational correlations, we hope to create an integrated
technique that can be used to filter web world social information and find parallels between similar tastes of
diverse user information in a variety of settings
International Journal of Engineering Research and Development (IJERD)IJERD Editor
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals,
yahoo journals, bing journals, International Journal of Engineering Research and Development, google journals, hard copy of journal
Data mining refers to the process of analysing the data from different perspectives and summarizing it into useful information.
Data mining software is one of the number of tools used for analysing data. It allows users to analyse from many different dimensions and angles, categorize it, and summarize the relationship identified.
Data mining is about technique for finding and describing Structural Patterns in data.
Data mining is the process of finding correlation or patterns among fields in large relational databases.
The process of extracting valid, previously unknown, comprehensible , and actionable information from large databases and using it to make crucial business decisions.
STRATEGY AND IMPLEMENTATION OF WEB MINING TOOLSAM Publications
In the current development, millions of clients are accessing daily the internet and World Wide Web (WWW) to search the information and achieve their necessities. Web mining is a technique to automatic discovers and Extract information from www. Websites are a common stage to discussion the information between users. Web mining is one of the applications of Data mining techniques for extracting information from web data. The area of web mining is web content mining, web usage mining and web structure mining. These three category focus on Knowledge discovery from web. Web content mining involves technique for summarization, classification, clustering and the process of extracting or discovering useful information web pages, it includes image, audio, video and metadata. Web usage mining is the process of extracting information from web server logs. Web structure mining it is the process of using graph theory to analyse the node and connection structure of a website and deals with the hyperlink structure of web. Web mining is a part of data mining which relates to various research communities such as information retrieval, database management systems and Artificial intelligence.
International Journal of Engineering Research and DevelopmentIJERD Editor
Electrical, Electronics and Computer Engineering,
Information Engineering and Technology,
Mechanical, Industrial and Manufacturing Engineering,
Automation and Mechatronics Engineering,
Material and Chemical Engineering,
Civil and Architecture Engineering,
Biotechnology and Bio Engineering,
Environmental Engineering,
Petroleum and Mining Engineering,
Marine and Agriculture engineering,
Aerospace Engineering.
Submission Deadline: 30th September 2022
Acceptance Notification: Within Three Days’ time period
Online Publication: Within 24 Hrs. time Period
Expected Date of Dispatch of Printed Journal: 5th October 2022
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...IAEME Publication
White layer thickness (WLT) formed and surface roughness in wire electric discharge turning (WEDT) of tungsten carbide composite has been made to model through response surface methodology (RSM). A Taguchi’s standard Design of experiments involving five input variables with three levels has been employed to establish a mathematical model between input parameters and responses. Percentage of cobalt content, spindle speed, Pulse on-time, wire feed and pulse off-time were changed during the experimental tests based on the Taguchi’s orthogonal array L27 (3^13). Analysis of variance (ANOVA) revealed that the mathematical models obtained can adequately describe performance within the parameters of the factors considered. There was a good agreement between the experimental and predicted values in this study.
More Related Content
Similar to ANALYTICAL IMPLEMENTATION OF WEB STRUCTURE MINING USING DATA ANALYSIS IN ONLINE BOOKING DOMAIN
Web personalization using clustering of web usage dataijfcstjournal
The exponential growth in the number and the complexity of information resources and services on the Web
has made log data an indispensable resource to characterize the users for Web-based environment. It
creates information of related web data in the form of hierarchy structure through approximation. This
hierarchy structure can be used as the input for a variety of data mining tasks such as clustering,
association rule mining, sequence mining etc.
In this paper, we present an approach for personalizing web user environment dynamically when he
interacting with web by clustering of web usage data using concept hierarchy. The system is inferred from
the web server’s access logs by means of data and web usage mining techniques to extract the information
about users. The extracted knowledge is used for the purpose of offering a personalized view of the
services to users.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting valuable information from web databases. Web mining is one of the data mining domains where data mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer behaviour, evaluate a particular website based on the information which is stored in web log files. Web mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library. Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase the complexity of dealing information from different web service providers. The collection of information becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting valuable information from web databases. Web mining is one of the data mining domains where data mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer behaviour, evaluate a particular website based on the information which is stored in web log files. Web mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library. Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase the complexity of dealing information from different web service providers. The collection of information becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting valuable information from web databases. Web mining is one of the data mining domains where data mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer behaviour, evaluate a particular website based on the information which is stored in web log files. Web mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library. Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase the complexity of dealing information from different web service providers. The collection of information becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
In this world of information technology, everyone has the tendency to do business electronically. Today
lot of businesses are happening on World Wide Web (WWW), it is very important for the website owner to
provide a better platform to attract more customers for their site. Providing information in a better way is
the solution to bring more customers or users. Customer is the end-user, who accessing the information
in a way it yields some credit to the web site owners. In this paper we define web mining and present a
method to utilize web mining in a better way to know the users and website behaviour which in turn
enhance the web site information to attract more users. This paper also presents an overview of the
various researches done on pattern extraction, web content mining and how it can be taken as a catalyst
for E-business.
Abstract: In many fields, such as industry, commerce, government, and education, knowledge discovery and data
mining can be immensely valuable to the subject of Artificial Intelligence. Because of the recent increase in
demand for KDD techniques, such as those used in machine learning, databases, statistics, knowledge acquisition,
data visualisation, and high performance computing, knowledge discovery and data mining have grown in
importance. By employing standard formulas for computational correlations, we hope to create an integrated
technique that can be used to filter web world social information and find parallels between similar tastes of
diverse user information in a variety of settings
International Journal of Engineering Research and Development (IJERD)IJERD Editor
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals,
yahoo journals, bing journals, International Journal of Engineering Research and Development, google journals, hard copy of journal
Data mining refers to the process of analysing the data from different perspectives and summarizing it into useful information.
Data mining software is one of the number of tools used for analysing data. It allows users to analyse from many different dimensions and angles, categorize it, and summarize the relationship identified.
Data mining is about technique for finding and describing Structural Patterns in data.
Data mining is the process of finding correlation or patterns among fields in large relational databases.
The process of extracting valid, previously unknown, comprehensible , and actionable information from large databases and using it to make crucial business decisions.
STRATEGY AND IMPLEMENTATION OF WEB MINING TOOLSAM Publications
In the current development, millions of clients are accessing daily the internet and World Wide Web (WWW) to search the information and achieve their necessities. Web mining is a technique to automatic discovers and Extract information from www. Websites are a common stage to discussion the information between users. Web mining is one of the applications of Data mining techniques for extracting information from web data. The area of web mining is web content mining, web usage mining and web structure mining. These three category focus on Knowledge discovery from web. Web content mining involves technique for summarization, classification, clustering and the process of extracting or discovering useful information web pages, it includes image, audio, video and metadata. Web usage mining is the process of extracting information from web server logs. Web structure mining it is the process of using graph theory to analyse the node and connection structure of a website and deals with the hyperlink structure of web. Web mining is a part of data mining which relates to various research communities such as information retrieval, database management systems and Artificial intelligence.
International Journal of Engineering Research and DevelopmentIJERD Editor
Electrical, Electronics and Computer Engineering,
Information Engineering and Technology,
Mechanical, Industrial and Manufacturing Engineering,
Automation and Mechatronics Engineering,
Material and Chemical Engineering,
Civil and Architecture Engineering,
Biotechnology and Bio Engineering,
Environmental Engineering,
Petroleum and Mining Engineering,
Marine and Agriculture engineering,
Aerospace Engineering.
Similar to ANALYTICAL IMPLEMENTATION OF WEB STRUCTURE MINING USING DATA ANALYSIS IN ONLINE BOOKING DOMAIN (20)
Submission Deadline: 30th September 2022
Acceptance Notification: Within Three Days’ time period
Online Publication: Within 24 Hrs. time Period
Expected Date of Dispatch of Printed Journal: 5th October 2022
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...IAEME Publication
White layer thickness (WLT) formed and surface roughness in wire electric discharge turning (WEDT) of tungsten carbide composite has been made to model through response surface methodology (RSM). A Taguchi’s standard Design of experiments involving five input variables with three levels has been employed to establish a mathematical model between input parameters and responses. Percentage of cobalt content, spindle speed, Pulse on-time, wire feed and pulse off-time were changed during the experimental tests based on the Taguchi’s orthogonal array L27 (3^13). Analysis of variance (ANOVA) revealed that the mathematical models obtained can adequately describe performance within the parameters of the factors considered. There was a good agreement between the experimental and predicted values in this study.
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSIAEME Publication
The study explores the reasons for a transgender to become entrepreneurs. In this study transgender entrepreneur was taken as independent variable and reasons to become as dependent variable. Data were collected through a structured questionnaire containing a five point Likert Scale. The study examined the data of 30 transgender entrepreneurs in Salem Municipal Corporation of Tamil Nadu State, India. Simple Random sampling technique was used. Garrett Ranking Technique (Percentile Position, Mean Scores) was used as the analysis for the present study to identify the top 13 stimulus factors for establishment of trans entrepreneurial venture. Economic advancement of a nation is governed upon the upshot of a resolute entrepreneurial doings. The conception of entrepreneurship has stretched and materialized to the socially deflated uncharted sections of transgender community. Presently transgenders have smashed their stereotypes and are making recent headlines of achievements in various fields of our Indian society. The trans-community is gradually being observed in a new light and has been trying to achieve prospective growth in entrepreneurship. The findings of the research revealed that the optimistic changes are taking place to change affirmative societal outlook of the transgender for entrepreneurial ventureship. It also laid emphasis on other transgenders to renovate their traditional living. The paper also highlights that legislators, supervisory body should endorse an impartial canons and reforms in Tamil Nadu Transgender Welfare Board Association.
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSIAEME Publication
Since ages gender difference is always a debatable theme whether caused by nature, evolution or environment. The birth of a transgender is dreadful not only for the child but also for their parents. The pain of living in the wrong physique and treated as second class victimized citizen is outrageous and fully harboured with vicious baseless negative scruples. For so long, social exclusion had perpetuated inequality and deprivation experiencing ingrained malign stigma and besieged victims of crime or violence across their life spans. They are pushed into the murky way of life with a source of eternal disgust, bereft sexual potency and perennial fear. Although they are highly visible but very little is known about them. The common public needs to comprehend the ravaged arrogance on these insensitive souls and assist in integrating them into the mainstream by offering equal opportunity, treat with humanity and respect their dignity. Entrepreneurship in the current age is endorsing the gender fairness movement. Unstable careers and economic inadequacy had inclined one of the gender variant people called Transgender to become entrepreneurs. These tiny budding entrepreneurs resulted in economic transition by means of employment, free from the clutches of stereotype jobs, raised standard of living and handful of financial empowerment. Besides all these inhibitions, they were able to witness a platform for skill set development that ignited them to enter into entrepreneurial domain. This paper epitomizes skill sets involved in trans-entrepreneurs of Thoothukudi Municipal Corporation of Tamil Nadu State and is a groundbreaking determination to sightsee various skills incorporated and the impact on entrepreneurship.
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSIAEME Publication
The banking and financial services industries are experiencing increased technology penetration. Among them, the banking industry has made technological advancements to better serve the general populace. The economy focused on transforming the banking sector's system into a cashless, paperless, and faceless one. The researcher wants to evaluate the user's intention for utilising a mobile banking application. The study also examines the variables affecting the user's behaviour intention when selecting specific applications for financial transactions. The researcher employed a well-structured questionnaire and a descriptive study methodology to gather the respondents' primary data utilising the snowball sampling technique. The study includes variables like performance expectations, effort expectations, social impact, enabling circumstances, and perceived risk. Each of the aforementioned variables has a major impact on how users utilise mobile banking applications. The outcome will assist the service provider in comprehending the user's history with mobile banking applications.
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSIAEME Publication
Technology upgradation in banking sector took the economy to view that payment mode towards online transactions using mobile applications. This system enabled connectivity between banks, Merchant and user in a convenient mode. there are various applications used for online transactions such as Google pay, Paytm, freecharge, mobikiwi, oxygen, phonepe and so on and it also includes mobile banking applications. The study aimed at evaluating the predilection of the user in adopting digital transaction. The study is descriptive in nature. The researcher used random sample techniques to collect the data. The findings reveal that mobile applications differ with the quality of service rendered by Gpay and Phonepe. The researcher suggest the Phonepe application should focus on implementing the application should be user friendly interface and Gpay on motivating the users to feel the importance of request for money and modes of payments in the application.
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOIAEME Publication
The prototype of a voice-based ATM for visually impaired using Arduino is to help people who are blind. This uses RFID cards which contain users fingerprint encrypted on it and interacts with the users through voice commands. ATM operates when sensor detects the presence of one person in the cabin. After scanning the RFID card, it will ask to select the mode like –normal or blind. User can select the respective mode through voice input, if blind mode is selected the balance check or cash withdraw can be done through voice input. Normal mode procedure is same as the existing ATM.
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IAEME Publication
There is increasing acceptability of emotional intelligence as a major factor in personality assessment and effective human resource management. Emotional intelligence as the ability to build capacity, empathize, co-operate, motivate and develop others cannot be divorced from both effective performance and human resource management systems. The human person is crucial in defining organizational leadership and fortunes in terms of challenges and opportunities and walking across both multinational and bilateral relationships. The growing complexity of the business world requires a great deal of self-confidence, integrity, communication, conflict and diversity management to keep the global enterprise within the paths of productivity and sustainability. Using the exploratory research design and 255 participants the result of this original study indicates strong positive correlation between emotional intelligence and effective human resource management. The paper offers suggestions on further studies between emotional intelligence and human capital development and recommends for conflict management as an integral part of effective human resource management.
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYIAEME Publication
Our life journey, in general, is closely defined by the way we understand the meaning of why we coexist and deal with its challenges. As we develop the "inspiration economy", we could say that nearly all of the challenges we have faced are opportunities that help us to discover the rest of our journey. In this note paper, we explore how being faced with the opportunity of being a close carer for an aging parent with dementia brought intangible discoveries that changed our insight of the meaning of the rest of our life journey.
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...IAEME Publication
The main objective of this study is to analyze the impact of aspects of Organizational Culture on the Effectiveness of the Performance Management System (PMS) in the Health Care Organization at Thanjavur. Organizational Culture and PMS play a crucial role in present-day organizations in achieving their objectives. PMS needs employees’ cooperation to achieve its intended objectives. Employees' cooperation depends upon the organization’s culture. The present study uses exploratory research to examine the relationship between the Organization's culture and the Effectiveness of the Performance Management System. The study uses a Structured Questionnaire to collect the primary data. For this study, Thirty-six non-clinical employees were selected from twelve randomly selected Health Care organizations at Thanjavur. Thirty-two fully completed questionnaires were received.
Living in 21st century in itself reminds all of us the necessity of police and its administration. As more and more we are entering into the modern society and culture, the more we require the services of the so called ‘Khaki Worthy’ men i.e., the police personnel. Whether we talk of Indian police or the other nation’s police, they all have the same recognition as they have in India. But as already mentioned, their services and requirements are different after the like 26th November, 2008 incidents, where they without saving their own lives has sacrificed themselves without any hitch and without caring about their respective family members and wards. In other words, they are like our heroes and mentors who can guide us from the darkness of fear, militancy, corruption and other dark sides of life and so on. Now the question arises, if Gandhi would have been alive today, what would have been his reaction/opinion to the police and its functioning? Would he have some thing different in his mind now what he had been in his mind before the partition or would he be going to start some Satyagraha in the form of some improvement in the functioning of the police administration? Really these questions or rather night mares can come to any one’s mind, when there is too much confusion is prevailing in our minds, when there is too much corruption in the society and when the polices working is also in the questioning because of one or the other case throughout the India. It is matter of great concern that we have to thing over our administration and our practical approach because the police personals are also like us, they are part and parcel of our society and among one of us, so why we all are pin pointing towards them.
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...IAEME Publication
The goal of this study was to see how talent management affected employee retention in the selected IT organizations in Chennai. The fundamental issue was the difficulty to attract, hire, and retain talented personnel who perform well and the gap between supply and demand of talent acquisition and retaining them within the firms. The study's main goals were to determine the impact of talent management on employee retention in IT companies in Chennai, investigate talent management strategies that IT companies could use to improve talent acquisition, performance management, career planning and formulate retention strategies that the IT firms could use. The respondents were given a structured close-ended questionnaire with the 5 Point Likert Scale as part of the study's quantitative research design. The target population consisted of 289 IT professionals. The questionnaires were distributed and collected by the researcher directly. The Statistical Package for Social Sciences (SPSS) was used to collect and analyse the questionnaire responses. Hypotheses that were formulated for the various areas of the study were tested using a variety of statistical tests. The key findings of the study suggested that talent management had an impact on employee retention. The studies also found that there is a clear link between the implementation of talent management and retention measures. Management should provide enough training and development for employees, clarify job responsibilities, provide adequate remuneration packages, and recognise employees for exceptional performance.
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...IAEME Publication
Globally, Millions of dollars were spent by the organizations for employing skilled Information Technology (IT) professionals. It is costly to replace unskilled employees with IT professionals possessing technical skills and competencies that aid in interconnecting the business processes. The organization’s employment tactics were forced to alter by globalization along with technological innovations as they consistently diminish to remain lean, outsource to concentrate on core competencies along with restructuring/reallocate personnel to gather efficiency. As other jobs, organizations or professions have become reasonably more appropriate in a shifting employment landscape, the above alterations trigger both involuntary as well as voluntary turnover. The employee view on jobs is also afflicted by the COVID-19 pandemic along with the employee-driven labour market. So, having effective strategies is necessary to tackle the withdrawal rate of employees. By associating Emotional Intelligence (EI) along with Talent Management (TM) in the IT industry, the rise in attrition rate was analyzed in this study. Only 303 respondents were collected out of 350 participants to whom questionnaires were distributed. From the employees of IT organizations located in Bangalore (India), the data were congregated. A simple random sampling methodology was employed to congregate data as of the respondents. Generating the hypothesis along with testing is eventuated. The effect of EI and TM along with regression analysis between TM and EI was analyzed. The outcomes indicated that employee and Organizational Performance (OP) were elevated by effective EI along with TM.
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...IAEME Publication
By implementing talent management strategy, organizations would have the option to retain their skilled professionals while additionally working on their overall performance. It is the course of appropriately utilizing the ideal individuals, setting them up for future top positions, exploring and dealing with their performance, and holding them back from leaving the organization. It is employee performance that determines the success of every organization. The firm quickly obtains an upper hand over its rivals in the event that its employees having particular skills that cannot be duplicated by the competitors. Thus, firms are centred on creating successful talent management practices and processes to deal with the unique human resources. Firms are additionally endeavouring to keep their top/key staff since on the off chance that they leave; the whole store of information leaves the firm's hands. The study's objective was to determine the impact of talent management on organizational performance among the selected IT organizations in Chennai. The study recommends that talent management limitedly affects performance. On the off chance that this talent is appropriately management and implemented properly, organizations might benefit as much as possible from their maintained assets to support development and productivity, both monetarily and non-monetarily.
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...IAEME Publication
Banking regulations act of India, 1949 defines banking as “acceptance of deposits for the purpose of lending or investment from the public, repayment on demand or otherwise and withdrawable through cheques, drafts order or otherwise”, the major participants of the Indian financial system are commercial banks, the financial institution encompassing term lending institutions. Investments institutions, specialized financial institution and the state level development banks, non banking financial companies (NBFC) and other market intermediaries such has the stock brokers and money lenders are among the oldest of the certain variants of NBFC and the oldest market participants. The asset quality of banks is one of the most important indicators of their financial health. The Indian banking sector has been facing severe problems of increasing Non- Performing Assets (NPAs). The NPAs growth directly and indirectly affects the quality of assets and profitability of banks. It also shows the efficiency of banks credit risk management and the recovery effectiveness. NPA do not generate any income, whereas, the bank is required to make provisions for such as assets that why is a double edge weapon. This paper outlines the concept of quality of bank loans of different types like Housing, Agriculture and MSME loans in state Haryana of selected public and private sector banks. This study is highlighting problems associated with the role of commercial bank in financing Small and Medium Scale Enterprises (SME). The overall objective of the research was to assess the effect of the financing provisions existing for the setting up and operations of MSMEs in the country and to generate recommendations for more robust financing mechanisms for successful operation of the MSMEs, in turn understanding the impact of MSME loans on financial institutions due to NPA. There are many research conducted on the topic of Non- Performing Assets (NPA) Management, concerning particular bank, comparative study of public and private banks etc. In this paper the researcher is considering the aggregate data of selected public sector and private sector banks and attempts to compare the NPA of Housing, Agriculture and MSME loans in state Haryana of public and private sector banks. The tools used in the study are average and Anova test and variance. The findings reveal that NPA is common problem for both public and private sector banks and is associated with all types of loans either that is housing loans, agriculture loans and loans to SMES. NPAs of both public and private sector banks show the increasing trend. In 2010-11 GNPA of public and private sector were at same level it was 2% but after 2010-11 it increased in many fold and at present there is GNPA in some more than 15%. It shows the dark area of Indian banking sector.
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...IAEME Publication
An experiment conducted in this study found that BaSO4 changed Nylon 6's mechanical properties. By changing the weight ratios, BaSO4 was used to make Nylon 6. This Researcher looked into how hard Nylon-6/BaSO4 composites are and how well they wear. Experiments were done based on Taguchi design L9. Nylon-6/BaSO4 composites can be tested for their hardness number using a Rockwell hardness testing apparatus. On Nylon/BaSO4, the wear behavior was measured by a wear monitor, pinon-disc friction by varying reinforcement, sliding speed, and sliding distance, and the microstructure of the crack surfaces was observed by SEM. This study provides significant contributions to ultimate strength by increasing BaSO4 content up to 16% in the composites, and sliding speed contributes 72.45% to the wear rate
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...IAEME Publication
The majority of the population in India lives in villages. The village is the back bone of the country. Village or rural industries play an important role in the national economy, particularly in the rural development. Developing the rural economy is one of the key indicators towards a country’s success. Whether it be the need to look after the welfare of the farmers or invest in rural infrastructure, Governments have to ensure that rural development isn’t compromised. The economic development of our country largely depends on the progress of rural areas and the standard of living of rural masses. Village or rural industries play an important role in the national economy, particularly in the rural development. Rural entrepreneurship is based on stimulating local entrepreneurial talent and the subsequent growth of indigenous enterprises. It recognizes opportunity in the rural areas and accelerates a unique blend of resources either inside or outside of agriculture. Rural entrepreneurship brings an economic value to the rural sector by creating new methods of production, new markets, new products and generate employment opportunities thereby ensuring continuous rural development. Social Entrepreneurship has the direct and primary objective of serving the society along with the earning profits. So, social entrepreneurship is different from the economic entrepreneurship as its basic objective is not to earn profits but for providing innovative solutions to meet the society needs which are not taken care by majority of the entrepreneurs as they are in the business for profit making as a sole objective. So, the Social Entrepreneurs have the huge growth potential particularly in the developing countries like India where we have huge societal disparities in terms of the financial positions of the population. Still 22 percent of the Indian population is below the poverty line and also there is disparity among the rural & urban population in terms of families living under BPL. 25.7 percent of the rural population & 13.7 percent of the urban population is under BPL which clearly shows the disparity of the poor people in the rural and urban areas. The need to develop social entrepreneurship in agriculture is dictated by a large number of social problems. Such problems include low living standards, unemployment, and social tension. The reasons that led to the emergence of the practice of social entrepreneurship are the above factors. The research problem lays upon disclosing the importance of role of social entrepreneurship in rural development of India. The paper the tendencies of social entrepreneurship in India, to present successful examples of such business for providing recommendations how to improve situation in rural areas in terms of social entrepreneurship development. Indian government has made some steps towards development of social enterprises, social entrepreneurship, and social in- novation, but a lot remains to be improved.
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...IAEME Publication
Distribution system is a critical link between the electric power distributor and the consumers. Most of the distribution networks commonly used by the electric utility is the radial distribution network. However in this type of network, it has technical issues such as enormous power losses which affect the quality of the supply. Nowadays, the introduction of Distributed Generation (DG) units in the system help improve and support the voltage profile of the network as well as the performance of the system components through power loss mitigation. In this study network reconfiguration was done using two meta-heuristic algorithms Particle Swarm Optimization and Gravitational Search Algorithm (PSO-GSA) to enhance power quality and voltage profile in the system when simultaneously applied with the DG units. Backward/Forward Sweep Method was used in the load flow analysis and simulated using the MATLAB program. Five cases were considered in the Reconfiguration based on the contribution of DG units. The proposed method was tested using IEEE 33 bus system. Based on the results, there was a voltage profile improvement in the system from 0.9038 p.u. to 0.9594 p.u.. The integration of DG in the network also reduced power losses from 210.98 kW to 69.3963 kW. Simulated results are drawn to show the performance of each case.
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...IAEME Publication
Manufacturing industries have witnessed an outburst in productivity. For productivity improvement manufacturing industries are taking various initiatives by using lean tools and techniques. However, in different manufacturing industries, frugal approach is applied in product design and services as a tool for improvement. Frugal approach contributed to prove less is more and seems indirectly contributing to improve productivity. Hence, there is need to understand status of frugal approach application in manufacturing industries. All manufacturing industries are trying hard and putting continuous efforts for competitive existence. For productivity improvements, manufacturing industries are coming up with different effective and efficient solutions in manufacturing processes and operations. To overcome current challenges, manufacturing industries have started using frugal approach in product design and services. For this study, methodology adopted with both primary and secondary sources of data. For primary source interview and observation technique is used and for secondary source review has done based on available literatures in website, printed magazines, manual etc. An attempt has made for understanding application of frugal approach with the study of manufacturing industry project. Manufacturing industry selected for this project study is Mahindra and Mahindra Ltd. This paper will help researcher to find the connections between the two concepts productivity improvement and frugal approach. This paper will help to understand significance of frugal approach for productivity improvement in manufacturing industry. This will also help to understand current scenario of frugal approach in manufacturing industry. In manufacturing industries various process are involved to deliver the final product. In the process of converting input in to output through manufacturing process productivity plays very critical role. Hence this study will help to evolve status of frugal approach in productivity improvement programme. The notion of frugal can be viewed as an approach towards productivity improvement in manufacturing industries.
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTIAEME Publication
In this paper, we investigated a queuing model of fuzzy environment-based a multiple channel queuing model (M/M/C) ( /FCFS) and study its performance under realistic conditions. It applies a nonagonal fuzzy number to analyse the relevant performance of a multiple channel queuing model (M/M/C) ( /FCFS). Based on the sub interval average ranking method for nonagonal fuzzy number, we convert fuzzy number to crisp one. Numerical results reveal that the efficiency of this method. Intuitively, the fuzzy environment adapts well to a multiple channel queuing models (M/M/C) ( /FCFS) are very well.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
2. Analytical Implementation of Web Structure Mining using Data Analysis in Online Booking Domain
http://www.iaeme.com/IJARET/index.asp 581 editor@iaeme.com
1. INTRODUCTION
The Internet is a global network that is continually changing and unstructured. The Internet is
the world's largest information source. Web mining aims to extract relevant information from
the internet. It is an interdisciplinary field that includes data mining, machine learning, natural
language processing, analytics, databases, recovery of information, media, and other
techniques. The amount of information available on the Internet is vast and readily
available(Kumar 2010). Knowledge is gained not only through the contents of websites, but
also from the Web's distinctive features, such as its hyperlink architecture and range of contents.
Study of these features frequently shows fascinating trends and new knowledge that can be
useful in improving user efficiency, hence approaches for extracting data from the web are an
unexpected topic of research. These strategies aid in the extraction of information from the Web
data by utilizing at least one structural or procedure (Weblog) information in the mining
procedure.
The Web is a large, exploding, diversified, and mostly unstructured data source that supplies
a tremendous amount of information while simultaneously increasing the complexity of dealing
with it from the many views of resource providers, Web service providers, and industry
experts(Victor 2016). The following are regarded as Web mining challenges: The Internet is
vast, and web pages are the semi-structured; Web data has a wide range of meanings. The
measure of the quality of information extracted; Inference of understanding based on the facts
gathered. Web structure mining is being used to identify the theory behind the connection
architectures of the Web pages, catalogue, and produce knowledge such as comparison and
NLP), Machine Learning, etc(Anurag Kumar and Singh, n.d.). Web mining includes the usage
of data mining tools to identify and retrieve data from the World Wide Web automatically. Web
structure mining assists people in retrieving necessary documentation by analyzing the Web's
framework.
Web mining involves the standard data mining application methods to the Web. However,
the inherent features of the Web necessitate significant tailoring and extension of existing
approaches. To begin with, even though the Web includes a massive amount of material, it is
dispersed via the internet. When we begin mining, we must first obtain the Webpage(Suvarn
Sharma and Bhagat 2016). Second, because web pages are data that can be organized,
information must be retrieved and represented in some manner for easier processing. Third,
because Web content has a wide range of meanings, the training or testing data collection should
be sufficiently big(Martínez‐Torres et al. 2011).Despite the problems mentioned previously,
the Web also offers diverse strategies to assist mining, for the instance, between linkages.
Web pages were an essential reserve to be leveraged. Aside from the difficulty of finding
necessary details, users may encounter other challenges when communicating with a Web,
including degree of value on the evidence create, the acquisition of new knowledge from the
content on the web, customization of the data create, and acquiring knowledge about those
another users(Boddu et al. 2010). Web mining methods can be used to partially or overcome
the concerns listed above. Further more, web mining methods should not be the only methods
available for resolving these issues. Other research groups, including database, machine
learning, and retrieval of information, also are tackling the aforementioned challenges(Chopra,
n.d.). This circumstance makes it difficult to know what types of Web mining exist.
2. RELATED WORK
(TasnimSiddiqui and Aljahdali 2013)the web is the most effective channel of statement in the
modern business. Many of the businesses were rethinking about their company strategy to
increase output. Customers and clients can find their products and unique business on the
internet, which gives them the option to do business. When compared to a traditional workplace,
3. M. Xavier Rex
http://www.iaeme.com/IJARET/index.asp 582 editor@iaeme.com
online business eliminates the barriers of time and space. Large corporations all about the world
were discovering that the e-commerce is more than just marketing on the Internet. Rather, it
optimizes efficiency in strategies to succeed with the other market giants. Data mining, also
known as knowledge finding, is employed by the researchers. Web mining is a data mining
technology which is used on the WWW. The information of wealth on the web.
(Zubi, n.d.)With the Web's rapid development, users can easily become lost in its complex
hyperstructure. The primary objective of the administrators of these websites is to deliver useful
data to users to meet their demands. Web mining is among the ways that can assist website
owners in this regard.This method heavily relies on web structure mining. In the web structure
mining, two webpage ranking algorithms, PageRank and the Hyperlink-Induced Topic Search
(HITS) are extensively utilized. When awarding rank scores, all methods treat all connections
equally. This paper also included a comparison of the two methods.Ranking Web pages is a
significant priority since it helps users find highly rated pages thatwere appropriate to their
query. Various factors have been suggested to rank web content's highest level of efficiency,
and this paper also includes a brief review of the two most notable ones.
(Babu, Sathish, and Ashok 2011) web usage mining is a type of web mining that uses data
mining algorithms to extract the useful data from a World Wide Web users' navigational
activity. Web Usage Mining normally involves of three tasks: pre-processing, design analysis,
and information retrieval. Data pre - processing needs to clean the user's log file by deleting
system logs including such errors or failure and repeated requests for the same address from the
same address, among other things. The primary goal of pattern recognition is to filter out boring
data and to imagine and explain stimulating patterns for users. The statistics gathered from the
log file can aid in the discovery of information. This information can be utilized to make
decisions on numerous criteria such as Excellent, Medium, and Weak users, as well as
Excellent, Moderate, and Poor websites, depending on hit rates on the website. The website's
architecture is reconstructed depending on user behavior or hit numbers, which offers fast
response to internet users, saves computer amount of memory, and so reduces HTTP requests
and resource use. This study tackles issues in three stages of the Web Usage Mining as well as
a Web Structure Mining.
(Hosseinkhani, Chaprut, and Taherdoost, n.d.)Criminal online data regularly provides
unknown and useful information to security agencies. The digital data used in forensics
investigation include data more about offenders' social media. However, there is a difficult issue
in evaluating those pieces of data. It is attributed to the fact that a researcher must manually
process retrieve valuable knowledge from web page text but then make relationships between
different sets of data and categorize people into a structured database, after which the set seems
to be prepared to be examined using massive terrorist connectivity analytical techniques.It is
assumed that the manually organizing method of data analysis is inefficient since errors are
likely to occur. Furthermore, because the quality of the resulting analyzed data is dependent on
the investigator's experience and expertise, its trustworthiness is not continuous. The more
skilled an operator is, the better the outcome. The major goal of this research is to propose a
framework to solve the process of researching suspected criminals using forensic analysis of
data, which covers the dependability gap.
(Awasthi and Gupta 2019)Since the World Wide Web is a massive library that is developing
rapidly, users can gather data and travel among different sites on the Net. When surfing the
web, users are frequently unable to reach the lookout page. Web customization is a strategy
proposed to relieve users of the burden of information overload on the internet and to provide
them with necessary information based on their requirements. Web customization is a strategy,
traditional marketing, and artistic work. Personalization necessitates implicitly or explicitly
4. Analytical Implementation of Web Structure Mining using Data Analysis in Online Booking Domain
http://www.iaeme.com/IJARET/index.asp 583 editor@iaeme.com
gathering visitor data and leveraging that data into the digital distribution system to determine
what data give to visitors and present it.
3. PROPOSED METHODOLOGY
Web Mining
The WWW is massive and rapidly expanding. It has a tremendous amount of material that is
constantly developing and updating. Different companies, institutions, public bodies, and
service centres keep their knowledge up to date regularly. The web pages lack a basic pattern
and also have a complicated style. Furthermore, web pages are more complexly arranged than
traditional text texts(Anurag Kumar and Singh, n.d.). The WWW offers its services to a wide
range of online surfers. Users of the internet may have a wide range of interests, needs, and
experiences. When a user looks for information available on the internet, he or she is only
interested in a small part of the content(Boddu et al. 2010). The challenges listed above inspire
people to figure out how to use internet resources effectively and result in web mining. The
majority of scholars refer to web mining as any strategy that applies data gathering to web data.
Web mining was described as use of the data mining techniques to retrieve the knowledge
through web data. Web mining tasks are classified into three categories: web usage mining,
web content mining, and web structure mining.
Figure 1 Web Mining Process
Techniques of Web mining
Web mining is widely classified into three categories depend on the set of information also be
gathered, as seen in Figure 2: web content mining, web structure mining, and web usage mining.
Web Content Mining
It's the process of obtaining meaningful information from online document content. Web pages
can contain textual information, multimedia, images, and graphs(Dinucă and Ciobanu 2012).
Often, the information of web documents is semi-structured and unstructured style, making it
complicated to extract relevant information or knowledge. Multimedia big data and text
analytics are useful for mining the information of online locations.
Web Usage Mining
The use of analysis tools to select relevant and regular usage patterns in web logfiles. Web
usage mining is the use of data mining methods to uncover stimulating and regular access
behaviours in weblog data(Asadianfam, Kolivand, and Asadianfam 2020). The intriguing usage
5. M. Xavier Rex
http://www.iaeme.com/IJARET/index.asp 584 editor@iaeme.com
patterns and knowledge retrieved will be used for a wide range of applications such as system
enhancement, website alteration, including the use of storage and pre-fetching to enhance user
navigation and personalized web.
4. WEB STRUCTURE MINING (WSM)
This determined by the construction of web data. It contains XML (hypertext markup language)
connections and tags used in online pages. HTML hyperlinks are commonly used to connect
various pages. So, by researching these hyperlink relationships, certain usage information, such
as the value of a specific web page, can be discovered(Anurag Kumar and Singh, n.d.). If a web
address is connected to other websites, it can be regarded as important and positioned in a
higher-level category. The most well-known researcher in the field of web structure mining is
networking site analysis.
Overview of WSM
Web mining entails the following activities:
• The process of locating and retrieving desired Web documents is known as resource
discovery.
• Information identification and the pre-processing: picking and pre-processing specified
information from the recovered Web resources mechanically.
• Simplification identifies broad trends on particular Web sites and also across numerous
sites dynamically.
• The analysis involves the study of validating and/or interpreting the patterns that have
been mined.
Figure 2 Process of web structure mining
6. Analytical Implementation of Web Structure Mining using Data Analysis in Online Booking Domain
http://www.iaeme.com/IJARET/index.asp 585 editor@iaeme.com
The three types of Web mining based on their relationship, which takes advantage of their
hyperlink structure. Web Structure mining is connected to hyperlink research and the
techniques presented below. Although there are three types of Web mining, the distinctions
between them are becoming increasingly blurred as they are all linked.
The task of Web structure mining would be arrangement with an architecture of the
hyperlinks only within a Web itself. Currently showing an old field of investigate. However, as
attention in Web mining has grown, structural engineering field has developed, culminating in
the emergence of a new study field known as Link Mining, placed at the connection of effort
in network theory, hyperlink, and machine learning and information extraction, structural
learning, and machine learning and graph mining, as well as inductive programming
techniques(Mohan, Kurmi, and Kumar 2017). This new field of research has the potential to
have a variety of application areas, such as the Internet.
The Web comprises a wide range of objects with little in common in terms of publishing
style and substance, with disparities in authoring content and style being higher than in
conventional collections of textual information. Web pages are the objects of the WWW, and
links are in-, out-, and co-citation. HTML tags, word occurrences, and anchor texts are
examples of characteristics. Because it is not able to precisely employ conventional approaches
such as database administration or information retrieval, this variety of objects presents new
opportunities and challenges. Some typical data mining jobs had become agitated as a result of
link mining. Following is a list of some of the potential link mining jobs that may be applied to
Web structure mining.
Classification based on links
Link-based categorization has been the most current advancement of the traditional data mining
activity to connected contexts. The aim is to forecast the classification of a website depending
on words on a site, connection among the sites, anchor text, HTML tags, and the other relevant
properties detected on the site.
Cluster Analysis Using Links
Cluster research helps to identify found in nature sub-classes. The earlier task, connection-based
clustering was uncontrolled and could be utilizing to identify designs within information.
Type of Link
Predicting the presence of links involves a wide variety of tasks, such as anticipating a sort of
the connection between two objects or forecasting the function of a link.
Strength of Link
Weights could be related to links.
Cardinality of Link
Predicting the number of linkages between items is the major task here. There are numerous
ways to construct conceptions of authority using the Web's link construction. The main goal of
building link mining applications is to make full use of our considerate of a Web's fundamental
social order.
5. WEB DATA STRUCTURE
The traditional data gathering scheme is primarily concerned with the information included in
the content of Web documents. Web mining techniques provide more information via
hyperlinks that connect various pages. The Web can be regarded as a directed labelled graph,
7. M. Xavier Rex
http://www.iaeme.com/IJARET/index.asp 586 editor@iaeme.com
with nodes representing documents or pages and edges representing hyperlinks among them.
Web Graph is the name given to this directed graph representation on the Web(Kapusta, Munk,
and Drlik 2018).A graph G is made up of two groups, B and W. B is a limited and nonempty
set of the vertices. The set W is made up of the two vertices, which are referred to as edges. The
notations B(G) and W(G) indicate the sets of nodes and edges of graph G, correspondingly. To
depict a graph, G = (B, W) can also be used. Figure 3displays a focused graph with three nodes
and a three edges.
Figure 3 Directed Graph G
G's vertices 𝐵, 𝐵(𝐺) = {𝑋, 𝑌, 𝑍}. 𝑊(𝐺) = (𝑋, 𝑌), (𝑌, 𝑋), (𝑌, 𝑍) of G's edges. The greatest
size of the network in a graph structure with 𝑛 vertices is 𝑛. (𝑛 − 1). With three vertices, the
greatest number of vertices is 3(3 − 1) = 6. There is no connection between
(𝑍, 𝑌), (𝑋, 𝑍), 𝑎𝑛𝑑 (𝑍, 𝑌) in the preceding example (Z, X). A structure called is said to have
been continued effective if there is a responsibility to make from a to b and also from v to u for
any pair of different vertices 𝑎 and 𝑏 in 𝑏(𝐺). The graph in Fig. 3 above is not tightly associated
because there is a no route from the vertices Z to node Y. The Web can be visualized as a vast
graph with hundreds of millions or billions of the nodes or vertices and billions of arcs or edges.
The section that follows describes hyperlink research as well as the techniques that are utilized
in hyperlink research for information extraction. Furthermore, the information on a Web page
will be grouped in a binary tree depend on different HTML and XML tags here on the webpage.
Mining work in this area has been centred on autonomously removing document object model
(DOM) components from documents.
Figure 4 Web structure mining in online sales domain
Hyperlinks
A hyperlink is a structural and functional unit which links one point on a web page to another,
either on the same or a distinct web page. An intra-document hyperlink attaches to a distinct
8. Analytical Implementation of Web Structure Mining using Data Analysis in Online Booking Domain
http://www.iaeme.com/IJARET/index.asp 587 editor@iaeme.com
section of the same page, whereas a multi hyperlink links two separate pages. There has been a
substantial amount of work on hyperlink analytics that may be used to generate an up-to-date
survey.
Many Web Sites never include words that are indicative of their primary function, and some
Web Pages have very little language, making text-based search tools challenging. However,
illustrate this page may be helpful(Sunny Sharma and Rana 2017). This type of "categorization"
appears in the text that accompanies the hyperlink to a website. Many studies have been
conducted, and answers to the challenge of searching, indexing, or searching the Internet have
been proposed, taking into consideration its architecture and also the meta-information
contained in hyperlinks and the text accompanying it into consideration.
Based on the Network Analysis, several algorithms have been presented. Using citation
monitoring, the Co-citation method and the Extended Co-citation method were developed.
These methods are simplistic, and deeper correlations between a webpage cannot be detected.
Three major algorithms, Hypertext Induced Topic Search (HITS), Weighted PageRank
(WPR)and Page Rank are reviewed and contrasted in detail below.
HITS
Authorities and Computing Hubs The two types of pages from the Web hyperlink architecture
in HITS concept: authority and centers (Boddu et al. 2010). HITS will discover authority and
hubs for a given query. Hubs and authorities have a mutually supportive connection, as per "a
better hub would be a page which ideas to several better authorities; a better institution in a
page that is referred as among excellent hubs." Figure 2 shows an example. HITS connects a
non-negative authorization weight a<j> to a non-negative hub weight b<j>. Display on Figure
3.
Figure 5 Densely linked authorities and hubs set
Figure 6 HITS basic operations
9. M. Xavier Rex
http://www.iaeme.com/IJARET/index.asp 588 editor@iaeme.com
Although HITS gives strong search outcomes for the wide variety of queries, it is not
function well in all circumstances for the three reasons listed below.
• Relationships among hosts that are mutually reinforcing. Several documents in one
server may point on a single document on another host, or even one paper with one host
may point to some documents on another host. These circumstances may lead to
incorrect definitions of what constitutes a good centre or a strong authority.
• Links that are generated automatically. Links included by the tool are common in web
documents generated by tools.
• Nodes that aren't relevant. Occasionally pages link to certain other pages that have
nothing to do with the search subject.
Model of Page Rank
The Page Rank algorithm uses the web's framework to evaluate the significance of online sites.
Brin and Page's technique expands on the knowledge of objective counting in-links equally by
the normalizing by the number of links on a page. "The suppose page X contains be set from 0
and 1. d is commonly set to 0.85. The more information about d can be found in the pages
𝑇1. . . 𝑇𝑛 that link to it," says the Page Rank methodology. The d parameter is an adamping
factor that can followthe section. 𝐿(𝑄) is also calculated as the number of links leading away
from page A. A page's Page Rank is listed as follows:
𝑃𝑅(𝑋) = (1 − 𝑏) + 𝑏 (
𝑃𝑅(𝑆1)
𝐿(𝑆1)
+. . . +
𝑃𝑅(𝑆𝑛)
𝐿(𝑆𝑛)
) (1)
Because the Page Ranks create a probabilistic model over internet pages, "The sum of all
page Ranks on all websites would equals one." "The d damping factor is the possibility that the
"randomised surfer" would become tired and want a random new site at every page." The rank
of a page is evenly separated throughout its out-links, making a significant contribution to the
rankings of a pages toward which it connect(Sherlin, n.d.). It's a repetitive formula; however, it
may be computed by starting with any set of rankings and making improvements to the
calculation until it conforms. Page Rank, It correlates to a web's normalised link matrix's
primary eigenvector, may be determined using a basic adaptive approach. The Page Rank
system takes about an hour to determine m's ranking of the pages million.
Table 1 Classification of Internet domain
Domain Context
.int Usually used by "International" sites, such as
NATO sites.
.gov Commonly seen on US Government websites.
.com A very well and widely used Domain name,
which has been utilised for any form of webpage.
.edu Universities, for example, are educational
institutions.
.mil Used for military installations in the United
States.
.org Originally meant for non-profit "organizations," it
is currently utilized for a wide range of websites.
For a time, it was run by the Internet Society.
.net Originally meant for Internet-related sites, and
now used in a wide range of websites.
10. Analytical Implementation of Web Structure Mining using Data Analysis in Online Booking Domain
http://www.iaeme.com/IJARET/index.asp 589 editor@iaeme.com
Knowledge Discovery
The data gathered through the website can aid in the discovery of knowledge. This information
can be utilized to make decisions on a variety of issues, such as 1. The web pages with the most
hits will be a most popular. 2. What are the various user movement patterns? 3. The amount of
time expended on each online page, that indicates the value of the web page. 4. If the amount
of time expended on a particular online page is insignificant, this suggests that the new website
has no vital information. 5(Boddu et al. 2010). The absence of a user's query for a website page
suggests that the page must be changed. 6. If a log file entry constantly states "redirect" for a
certain web page, the website creative director must be contacted. Excellent websites will be
relocated extremely close to the main website, while middle-class websites will be relocated to
the next level. If the website owner and designer agree, the pages with the highest visit count
can be prioritized for placement closer to the home page. The heap tree can be constructed
depend on the hit counts recorded in the log file throughout a specific session(Akshi Kumar,
Dabas, and Hooda 2020). This heaps tree developed will assist us in making decisions regarding
the architecture of a webpage during the following intervals so the most popular internet pages
can be carried extremely close to the home/parent website page. Within this restructure, web
users will have faster admittance to websites while also making the most use of resources and
computer system memory.
Applications
PageRank is being utilized by Google in conjunction with other features such as keyword
phrases, IR measurements, and vicinity. HITS was initially employed in IBM's Clever web
browser, and PageRank is used by Google in conjunction with other characteristics including
an anchor text, IR metrics, and proximity. The concept of a honesty stems from the knowledge
that want to find not just a list of relevant pages, but the best possible set of relevant sections.
The Web, on the other hand, is made up of not only webpages and also connections that connect
them(Kanathey, Thakur, and Jaloree 2018). This structure offers a great deal of information
that should be taken advantage of. PageRank and HITS were ranking algorithms in which the
scores can be calculated as a reference value in a linear model. HITS and the PageRank are
utilized as beginning points for creative solutions, and all these two methodologies have some
expansions. There are a variety of additional link-based approaches that can be used on the
Internet. Link resources can be employed for clustering or classifying Web pages in addition to
weighing them. The theory is founded on the assumptions that (1) if page p1 links to page
𝑝2, 𝑝1 should have comparable content to 𝑝2, and (2) if 𝑝1 𝑎𝑛𝑑 𝑝2 are get co-cited by certain
mutual pages, 𝑝1 𝑎𝑛𝑑 𝑝2 should likewise have alike content. Regarding their reference and co-
citation qualities among some of the pages, web pages can be grouped into a variety of the
connected page groups.
As a result, ranking depending on the content of data can be enhanced. PageRank, HITS, as
well as another link-based algorithm, will be used to rank page sections. The fundamental ideas
are that: (1) important blocks have higher weighted links, (2) a component is conceptually
related to a page if it has a link grounded with it to the page, and (3) two pages are get similar
if they are co-cited through around prevalent characterized the important variables of a block
to respect towards its shape and location in the computer monitor when browsing. Their
findings show that block-level Page Rank and HITS can greatly increase the recognition rate.
11. M. Xavier Rex
http://www.iaeme.com/IJARET/index.asp 590 editor@iaeme.com
6. RESULT AND DISCUSSION
The following is the main technique for putting web structure mining into action:
1. Manual or automatic extraction of page rank.
2. Obtaining hyperlinks from a web page.
3. Domain classification on the internet.
4. Computation of major domain influences.
5. Determine the attributes of the URL.
Take a look at Figure 7 for an example of a hyperlink design for three pages X, Y, and Z.
Equation can be used to determine PageRank for pages X, Y, and Z.
Figure 7 Structure of Hyperlink for 3 pages
Let's start with a PageRank of 1.0 and work way down. d is set at 0.85 as a damping factor.
𝑃𝑅(𝑋) = (1 − 𝑑) + 𝑑 (
𝑃𝑅(𝐿)
𝐿(𝐿)
) = (1 − 0.85) + 0.85(
1
2
)
= 0.15 + 0.425 = 0.575 (2)
𝑃𝑅(𝑌) = (1 − 𝑑) + 𝑑((
𝑃𝑅(𝑋)
𝐿(𝑋)
+ (
𝑃𝑅(𝐿)
𝐿(𝐿)
)
= 0.819 (3)
𝑃𝑅(𝑍) = (1 − 𝑑) + 𝑑((
𝑃𝑅(𝑋)
𝐿(𝑋)
+ (
𝑃𝑅(𝑌)
𝐿(𝑌)
)
= 1.091 (4)
Take the following PageRank scores from the second iteration; (2), (3), (4).
𝑃𝑅(𝑋) = 0.15 + 0.85(1.091/2) = 0.614 (5)
𝑃𝑅(𝑌) = 0.15 + 0.85 ((
0.614
2
) + (
1.091
2
)) = 0.875 (6)
𝑃𝑅(𝑍) = 0.15 + 0.85 ((
0.614
2
) + (
0.875
1
)) = 1.155 (7)
The following PageRanks were obtained after several more iterations of the above
algorithm, as given in Table 2.
12. Analytical Implementation of Web Structure Mining using Data Analysis in Online Booking Domain
http://www.iaeme.com/IJARET/index.asp 591 editor@iaeme.com
Table 2 The iterative calculation for Page rank
Iteration PR(X) PR(Y) PR(Z)
0 1 1 1
1 0.576 0.818 1.092
2 0.614 0.875 1.155
… … … …
15 0.702 0.998 1.296
16 0.702 0.998 1.296
It is simple to compute and obtain the PageRank scores for such a small group of pages, but
it is more difficult to use it for a Web with billions of pages. As could see that PageRank of C
is greater than PageRank of B and A in Table 1 above. It's because, as illustrated in Fig. 8, Page
C contains two incoming and two outgoing links. There are two inbound connections and one
outgoing link on Page B. Since Page A only has one incoming connection and two outgoing
links, it has the weakest PageRank. Following iteration 15, the PageRank for the pages in Table
1 is normalized. PageRank has resolved to a suitable range, according to previous tests. Figure
8 depicts the convergence of PageRank computation for Table 1 as a graph.
Figure 8 PageRank Convergence Calculation
Weighted page rank Algorithm
Rather than assigning a lower rank value to the more significant pages, this approach assigns a
higher rank number to them. Each outgoing link is given an organization value to its
significance by distributing the rank value of the page equally between its outgoing connected
links. Both the backlink and the forward link are given equal weight in this algorithm. The
number of links pointing to a specific website is known as an incoming link, whereas the
number of links flowing out is known as an outgoing link. Due to the use of two factors named
backlink and the forward link, the technique is more effective than the search ranking algorithm.
The number of inbound and outbound links is noted, and the labels Win and W out are easily
assigned. The meaning is provided to incoming and outgoing links in terms of weight values.
Win (x, y) and W out (x, y) are the two terms for this. The weight of links (x, y) as stated in the
equation is Win (x, y). Finally, the evaluation is determined by the number of inbound links to
page y and the total number of inbound links to all of document x's reference pages.
𝑊(𝑥,𝑦)
𝑖𝑛
=
𝑙𝑦
∑ 𝑙𝑝
𝑃∈𝑅(𝑥)
(8)
0
0.2
0.4
0.6
0.8
1
1.2
1.4
1 2 3 4 5 6 7 8
Values
of
PageRank
Iteration
PR(X) PR(Y) PR(Z)
13. M. Xavier Rex
http://www.iaeme.com/IJARET/index.asp 592 editor@iaeme.com
The number of inbound links on page n is denoted by In. R(x) is the reference page of the
list page x, and lp - is the number of inbound links of page p. The weight of links (x, y) as
provided by the equation is W out(x, y). The final figure is derived from the number of outbound
links on page 𝑛 and the total number of outbound links on almost all of the page x's references
pages.
𝑊(𝑥,𝑦)
𝑜𝑢𝑡
=
𝑂𝑦
∑ 𝑂𝑝
𝑃∈𝑅(𝑥)
(9)
Page y's number of outbound links is Oy, while page p's number of outbound links is Op.
The weighted Page ranking would then be computed as follows:
𝑊𝑃𝑅(𝑦) = (1 − 𝑑) + 𝑑 ∑ 𝑊𝑃𝑅(𝑥)𝑊(𝑥,𝑦)
𝑖𝑛
𝑊(𝑥,𝑦)
𝑜𝑢𝑡
𝑥𝜖𝐵(𝑦)
(10)
HITS Algorithm
Hubs and authorities are the two different types of Web pages. Organizations are pages that
contain essential information. Hub pages serve as reference lists, directing users to authoritative
sources. As a result, a good hub page on a subject links to many trustworthy sheets on that topic,
and a better authority page links to many good hubs pages on the same topic. Figure 4 depicts
hubs and authorities, as well as their calculations. According to Kleinberg, a page can serve as
both a hub and an authority. This circular connection leads to the development of the HITS
evolutionary method (Hyperlink Induced Topic Search).
The HITS algorithm considers the WWW as a graphical model G (N, T), with N denoting
pages and T denoting links. The HITS algorithm it contains two basic phases. The sampling
stage is the first, and the recursive step is the second. In the step of sampling, a set of specific
pages for the provided enquiry is gathered, i.e., a sub-graph S of R with a high domain authority
page count is obtained.
𝐻𝑤 = ∑ 𝐴𝑠
𝑞∈𝑙(𝑔)
(11)
𝐴𝑤 = ∑ 𝐻𝑠
𝑞∈𝐵(𝑔)
(12)
Where𝐻𝑤means the hub weight, 𝐴𝑤 indicates the authority weight, 𝐼(𝑔) denotes the set of
references and referral pages on page 𝑤, and 𝐵(𝑔)represents the set of references and the
referral pages on page𝑤. The hub weight of a site is related to the total authority weights of the
pages it connects to. Figure 9 shows an example of how reputation and hub scores are
calculated.
Figure 9 Hubs and Authorities Calculation
14. Analytical Implementation of Web Structure Mining using Data Analysis in Online Booking Domain
http://www.iaeme.com/IJARET/index.asp 593 editor@iaeme.com
𝐴𝑤 = 𝐻𝑆1 + 𝐻𝑆2 + 𝐻𝑆3
𝐻𝑤 = 𝐴𝑅1 + 𝐴𝑅2 + 𝐴𝑅3
The following are the HITS constraints Algorithm:
• Hubs and authority: It's difficult to tell the difference between hubs and authority since
many websites serve as both hubs and authorities.
• Topic drift: Due to similar weighting, HITS will not always provide the most necessary
documentation to the user's searches.
• Automatically created links: HITS values automatically generated links equally, even if
they don't produce relevant topics for the user's query.
• Efficiency: In real-time, the HITS algorithm is inefficient.
7. CONCLUSION
Web Structure Mining is a useful tool for extracting data from previous user behaviour. Web
Structure Mining is a key component of this strategy. Web Structure Mining employs several
techniques to rank the relevant sites, all of which regard all links similarly when allocating the
ranking score.The significance of Web structure mining in terms of information retrieval is
discussed. Web mining is a study topic that focuses on web knowledge problems utilizing web
structure mining and has developed Link mining and block-level connection mining. We also
looked at two common algorithms, HITS, and PageRank, for this. Both practiced the relevance
of a webpage based on the web's hypertext links. The primary goal of this work is to investigate
the hyperlink organization and to grasp the Web graph directly. Web mining is the process of
obtaining data from the internet in the most efficient way possible. Web structure mining
generally involves many methods that lead to the retrieval of data from every website. In
general, web structure mining is the process of effectively retrieving data from a website for an
online user. It believes that, because this is such a broad topic with so much work to be done,
this paper will serve as a useful preliminary step for finding research programs.
REFERENCES
[1] Asadianfam, Shiva, Hoshang Kolivand, and Sima Asadianfam. 2020. “A New Approach for
Web Usage Mining Using Case Based Reasoning.” SN Applied Sciences 2 (7): 1–11.
[2] Awasthi, Satya Prakash, and Sandeep Gupta. 2019. “Analysis : Web Personalization
Association Via Web Mining Technique” 14 (10): 5.
[3] Babu, D Suresh, P Sathish, and J Ashok. 2011. “Fusion of Web Structure Mining and Web
Usage Mining” 2: 4.
[4] Boddu, Sekhar Babu, V.P Krishna Anne, Rajesekhara Rao Kurra, and Durgesh Kumar Mishra.
2010. “Knowledge Discovery and Retrieval on World Wide Web Using Web Structure
Mining.” In 2010 Fourth Asia International Conference on Mathematical/Analytical Modelling
and Computer Simulation, 532–37. Kota Kinabalu, Malaysia: IEEE.
https://doi.org/10.1109/AMS.2010.108.
[5] Chopra, Preeti. n.d. “A Survey on Improving the Efficiency of Different Web Structure Mining
Algorithms” 2 (3): 3.
[6] Dinucă, Claudia Elena, and Dumitru Ciobanu. 2012. “Web Content Mining.” Annals of the
University of Petroşani. Economics 12: 85–92.
15. M. Xavier Rex
http://www.iaeme.com/IJARET/index.asp 594 editor@iaeme.com
[7] Hosseinkhani, Javad, Suriayati Chaprut, and Hamed Taherdoost. n.d. “Criminal Network
Mining by Web Structure and Content Mining.” Advances in Remote Sensing, 6.
[8] Kanathey, Kavita, RS Thakur, and Shailesh Jaloree. 2018. “Ranking of Web Pages Using
Aggregation of Page Rank and Hits Algorithm.” International Journal of Advanced Studies in
Computers, Science and Engineering 7 (2): 17–22.
[9] Kapusta, Jozef, Michal Munk, and Martin Drlik. 2018. “Website Structure Improvement Based
on the Combination of Selected Web Structure and Web Usage Mining Methods.” International
Journal of Information Technology & Decision Making 17 (06): 1743–76.
[10] Kumar. 2010. “Web Structure Mining: Exploring Hyperlinks and Algorithms for Information
Retrieval.” American Journal of Applied Sciences 7 (6): 840–45.
https://doi.org/10.3844/ajassp.2010.840.845.
[11] Kumar, Akshi, Vikrant Dabas, and Parul Hooda. 2020. “Text Classification Algorithms for
Mining Unstructured Data: A SWOT Analysis.” International Journal of Information
Technology 12 (4): 1159–69.
[12] Kumar, Anurag, and Ravi Kumar Singh. n.d. “A Study on Web Structure Mining” 04 (1): 7.
[13] Martínez‐Torres, M.R., Sergio L. Toral, Beatriz Palacios, and Federico Barrero. 2011. “Web
Site Structure Mining Using Social Network Analysis.” Internet Research 21 (2): 104–23.
https://doi.org/10.1108/10662241111123711.
[14] Mohan, Kuber, Jitendra Kurmi, and Sanjay Kumar. 2017. “A Survey on Web Structure Mining.”
International Journal of Advanced Research in Computer Science 8 (3): 227–32.
[15] Sharma, Sunny, and Vijay Rana. 2017. “Web Personalization through Semantic Annotation
System.” Advances in Computational Sciences and Technology 10 (6): 1683–90.
[16] Sharma, Suvarn, and Amit Bhagat. 2016. “Data Preprocessing Algorithm for Web Structure
Mining.” In 2016 Fifth International Conference on Eco-Friendly Computing and
Communication Systems (ICECCS), 94–98. Bhopal: IEEE. https://doi.org/10.1109/Eco-
friendly.2016.7893249.
[17] Sherlin, Mrs C Clement. n.d. “A Comparative Analysis Of Web Structure Mining And Link
Based Object Ranking.”
[18] TasnimSiddiqui, Ahmad, and Sultan Aljahdali. 2013. “Web Mining Techniques in E-Commerce
Applications.” International Journal of Computer Applications 69 (8): 39–43.
https://doi.org/10.5120/11864-7648.
[19] Victor, Dr S P. 2016. “Analytical Implementation of Web Structure Mining Using Data Analysis
in Educational Domain” 11 (4): 5.
[20] Zubi, Zakaria Suliman. n.d. “Ranking WebPages Using Web Structure Mining Concepts.”
Signals and Systems, 8.