Web Data Mining is the current field of analysis which is a combination of two research area known as Data Mining and World Wide Web. Web Data Mining research associates with various research diversities like Database, Artificial Intelligence and Information redeem. The mining techniques are categorized into various categories namely Web Content Mining, Web Structure Mining and Web Usage Mining. In this work, analysis of mining techniques are done. From the analysis it has been concluded that Web Content Mining has unstructured or semi- structure view of data whereas Web Structure Mining have linked structure and Web Usage Mining mainly includes interaction.
In this Research paper, we present an overview of
research issues in web mining. We discuss mining with respect to
web data referred here as web data mining. In particular, our
focus is on web data mining research in context of our web
warehousing project.We have categorized web data mining into
three areas; web content mining, web structure mining and web
usage mining. We have highlighted and discussed various
research issues involved in each of these web data mining
category. We believe that web data mining will be the topic of
exploratory research in near future.
Web Page Recommendation Using Web MiningIJERA Editor
On World Wide Web various kind of content are generated in huge amount, so to give relevant result to user web recommendation become important part of web application. On web different kind of web recommendation are made available to user every day that includes Image, Video, Audio, query suggestion and web page. In this paper we are aiming at providing framework for web page recommendation. 1) First we describe the basics of web mining, types of web mining. 2) Details of each web mining technique.3)We propose the architecture for the personalized web page recommendation.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
In this Research paper, we present an overview of
research issues in web mining. We discuss mining with respect to
web data referred here as web data mining. In particular, our
focus is on web data mining research in context of our web
warehousing project.We have categorized web data mining into
three areas; web content mining, web structure mining and web
usage mining. We have highlighted and discussed various
research issues involved in each of these web data mining
category. We believe that web data mining will be the topic of
exploratory research in near future.
Web Page Recommendation Using Web MiningIJERA Editor
On World Wide Web various kind of content are generated in huge amount, so to give relevant result to user web recommendation become important part of web application. On web different kind of web recommendation are made available to user every day that includes Image, Video, Audio, query suggestion and web page. In this paper we are aiming at providing framework for web page recommendation. 1) First we describe the basics of web mining, types of web mining. 2) Details of each web mining technique.3)We propose the architecture for the personalized web page recommendation.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
IDENTIFYING IMPORTANT FEATURES OF USERS TO IMPROVE PAGE RANKING ALGORITHMSIJwest
Web is a wide, various and dynamic environment in which different users publish their documents. Webmining is one of data mining applications in which web patterns are explored. Studies on web mining can be categorized into three classes: application mining, content mining and structure mining. Today, internet has found an increasing significance. Search engines are considered as an important tool to respond users’ interactions. Among algorithms which is used to find pages desired by users is page rank algorithm which ranks pages based on users’ interests. However, as being the most widely used algorithm by search engines including Google, this algorithm has proved its eligibility compared to similar algorithm, but considering growth speed of Internet and increase in using this technology, improving performance of this algorithm is considered as one of the web mining necessities. Current study emphasizes on Ant Colony algorithm and marks most visited links based on higher amount of pheromone. Results of the proposed algorithm indicate high accuracy of this method compared to previous methods. Ant Colony Algorithm as one of the swarm intelligence algorithms inspired by social behavior of ants can be effective in modeling social behavior of web users. In addition, application mining and structure mining techniques can be used simultaneously to improve page ranking performance.
COST-SENSITIVE TOPICAL DATA ACQUISITION FROM THE WEBIJDKP
The cost of acquiring training data instances for induction of data mining models is one of the main concerns in real-world problems. The web is a comprehensive source for many types of data which can be used for data mining tasks. But the distributed and dynamic nature of web dictates the use of solutions which can handle these characteristics. In this paper, we introduce an automatic method for topical data acquisition from the web. We propose a new type of topical crawlers that use a hybrid link context extraction method for topical crawling to acquire on-topic web pages with minimum bandwidth usage and with the lowest cost. The new link context extraction method which is called Block Text Window (BTW), combines a text window method with a block-based method and overcomes challenges of each of these methods using the advantages of the other one. Experimental results show the predominance of BTW in comparison with state of the art automatic topical web data acquisition methods based on standard metrics.
Multi Similarity Measure based Result Merging Strategies in Meta Search EngineIDES Editor
In Meta Search Engine result merging is the key
component. Meta Search Engines provide a uniform query
interface for Internet users to search for information.
Depending on users’ needs, they select relevant sources and
map user queries into the target search engines, subsequently
merging the results. The effectiveness of a Meta Search
Engine is closely related to the result merging algorithm it
employs. In this paper, we have proposed a Meta Search
Engine, which has two distinct steps (1) searching through
surface and deep search engine, and (2) Ranking the results
through the designed ranking algorithm. Initially, the query
given by the user is inputted to the deep and surface search
engine. The proposed method used two distinct algorithms
for ranking the search results, concept similarity based
method and cosine similarity based method. Once the results
from various search engines are ranked, the proposed Meta
Search Engine merges them into a single ranked list. Finally,
the experimentation will be done to prove the efficiency of
the proposed visible and invisible web-based Meta Search
Engine in merging the relevant pages. TSAP is used as the
evaluation criteria and the algorithms are evaluated based on
these criteria.
A Novel Data Extraction and Alignment Method for Web DatabasesIJMER
International Journal of Modern Engineering Research (IJMER) is Peer reviewed, online Journal. It serves as an international archival forum of scholarly research related to engineering and science education.
International Journal of Modern Engineering Research (IJMER) covers all the fields of engineering and science: Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Computer Engineering, Agricultural Engineering, Aerospace Engineering, Thermodynamics, Structural Engineering, Control Engineering, Robotics, Mechatronics, Fluid Mechanics, Nanotechnology, Simulators, Web-based Learning, Remote Laboratories, Engineering Design Methods, Education Research, Students' Satisfaction and Motivation, Global Projects, and Assessment…. And many more.
An Enhanced Approach for Detecting User's Behavior Applying Country-Wise Loca...IJSRD
The development of the web in past few years has created a lot of challenge in this field. The new work in this field is the search of the data in a search tree pattern based on tree. Various sequential mining algorithms have been devoloped till date. Web usage mining is used to operate the web server logs, that contains the navigation history of the user. Recommendater system is explained properly with the explanation of whole procedure of the recommendater system. The search results of the data leads to the proper ad efficient search. But the problem was the time utilization and the search results generated from them. So, a new local search algorithm is proposed for country-wise search that makes the searching more efficient on local results basis. This approach has lead to an advancement in the search based methods and the results generated.
Enhance Crawler For Efficiently Harvesting Deep Web Interfacesrahulmonikasharma
Scenario in web is varying quickly and size of web resources is rising, efficiency has become a challenging problem for crawling such data. The hidden web content is the data that cannot be indexed by search engines as they always stay behind searchable web interfaces. The proposed system purposes to develop a framework for focused crawler for efficient gathering hidden web interfaces. Firstly Crawler performs site-based searching for getting center pages with the help of web search tools to avoid from visiting additional number of pages. To get more specific results for a focused crawler, projected crawler ranks websites by giving high priority to more related ones for a given search. Crawler accomplishes fast in-site searching via watching for more relevant links with an adaptive link ranking. Here we have incorporated spell checker for giving correct input and apply reverse searching with incremental site prioritizing for wide-ranging coverage of hidden web sites.
Identifying the Number of Visitors to improve Website Usability from Educatio...Editor IJCATR
Web usage mining deals with understanding the Visitor’s behaviour with a Website. It helps in understanding the concerns
such as present and future probability of every website user, relationship between behaviour and website usability. It has different
branches such as web content mining, web structure and web usage mining. The focus of this paper is on web mining usage patterns of
an educational institution web log data. There are three types of web related log data namely web access log, error log and proxy log
data. In this paper web access log data has been used as dataset because the web access log data is the typical source of navigational
behaviour of the website visitor. The study of web server log analysis is helpful in applying the web mining techniques.
Comparison of Routing protocols in Wireless Sensor Networks: A Detailed Surveytheijes
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
The papers for publication in The International Journal of Engineering& Science are selected through rigorous peer reviews to ensure originality, timeliness, relevance, and readability.
Theoretical work submitted to the Journal should be original in its motivation or modeling structure. Empirical analysis should be based on a theoretical framework and should be capable of replication. It is expected that all materials required for replication (including computer programs and data sets) should be available upon request to the authors.
The International Journal of Engineering & Science would take much care in making your article published without much delay with your kind cooperation
Stratigraphy and Lithology of Naokelekan Formation in Iraqi Kurdistan-Reviewtheijes
The study depends on analyses of slabs, thin sections, and smear slides of 44 samples of 18 outcrops of Naokelekan Formation from Iraqi Kurdistan. The study revealed that the age of Naokelekan Formation is Callovian-Upper Oxfordian. The Cyclagelosphaera margerelii sp. indicates restricted marine environment while the Watznaueria barnesiae sp. point to high latitude geographic location of depositional basin which was warm water that was characterized by low-nutrient. The field observations and nannofossils revealed that the Middle and Upper parts of Naokelekan Formation either eroded or were not deposited in uppermost northwestern Iraq.
On Integrablity Of F-Structure Satisfying F 2K+1 +F=0theijes
The purpose of this paper is to study integrability of the F-structure satisying F 2K+1+ F=0, where K is a positive integer. Nijenhuis tensor, metric F-structure, fundamental 2-form have also been discussed.
IDENTIFYING IMPORTANT FEATURES OF USERS TO IMPROVE PAGE RANKING ALGORITHMSIJwest
Web is a wide, various and dynamic environment in which different users publish their documents. Webmining is one of data mining applications in which web patterns are explored. Studies on web mining can be categorized into three classes: application mining, content mining and structure mining. Today, internet has found an increasing significance. Search engines are considered as an important tool to respond users’ interactions. Among algorithms which is used to find pages desired by users is page rank algorithm which ranks pages based on users’ interests. However, as being the most widely used algorithm by search engines including Google, this algorithm has proved its eligibility compared to similar algorithm, but considering growth speed of Internet and increase in using this technology, improving performance of this algorithm is considered as one of the web mining necessities. Current study emphasizes on Ant Colony algorithm and marks most visited links based on higher amount of pheromone. Results of the proposed algorithm indicate high accuracy of this method compared to previous methods. Ant Colony Algorithm as one of the swarm intelligence algorithms inspired by social behavior of ants can be effective in modeling social behavior of web users. In addition, application mining and structure mining techniques can be used simultaneously to improve page ranking performance.
COST-SENSITIVE TOPICAL DATA ACQUISITION FROM THE WEBIJDKP
The cost of acquiring training data instances for induction of data mining models is one of the main concerns in real-world problems. The web is a comprehensive source for many types of data which can be used for data mining tasks. But the distributed and dynamic nature of web dictates the use of solutions which can handle these characteristics. In this paper, we introduce an automatic method for topical data acquisition from the web. We propose a new type of topical crawlers that use a hybrid link context extraction method for topical crawling to acquire on-topic web pages with minimum bandwidth usage and with the lowest cost. The new link context extraction method which is called Block Text Window (BTW), combines a text window method with a block-based method and overcomes challenges of each of these methods using the advantages of the other one. Experimental results show the predominance of BTW in comparison with state of the art automatic topical web data acquisition methods based on standard metrics.
Multi Similarity Measure based Result Merging Strategies in Meta Search EngineIDES Editor
In Meta Search Engine result merging is the key
component. Meta Search Engines provide a uniform query
interface for Internet users to search for information.
Depending on users’ needs, they select relevant sources and
map user queries into the target search engines, subsequently
merging the results. The effectiveness of a Meta Search
Engine is closely related to the result merging algorithm it
employs. In this paper, we have proposed a Meta Search
Engine, which has two distinct steps (1) searching through
surface and deep search engine, and (2) Ranking the results
through the designed ranking algorithm. Initially, the query
given by the user is inputted to the deep and surface search
engine. The proposed method used two distinct algorithms
for ranking the search results, concept similarity based
method and cosine similarity based method. Once the results
from various search engines are ranked, the proposed Meta
Search Engine merges them into a single ranked list. Finally,
the experimentation will be done to prove the efficiency of
the proposed visible and invisible web-based Meta Search
Engine in merging the relevant pages. TSAP is used as the
evaluation criteria and the algorithms are evaluated based on
these criteria.
A Novel Data Extraction and Alignment Method for Web DatabasesIJMER
International Journal of Modern Engineering Research (IJMER) is Peer reviewed, online Journal. It serves as an international archival forum of scholarly research related to engineering and science education.
International Journal of Modern Engineering Research (IJMER) covers all the fields of engineering and science: Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Computer Engineering, Agricultural Engineering, Aerospace Engineering, Thermodynamics, Structural Engineering, Control Engineering, Robotics, Mechatronics, Fluid Mechanics, Nanotechnology, Simulators, Web-based Learning, Remote Laboratories, Engineering Design Methods, Education Research, Students' Satisfaction and Motivation, Global Projects, and Assessment…. And many more.
An Enhanced Approach for Detecting User's Behavior Applying Country-Wise Loca...IJSRD
The development of the web in past few years has created a lot of challenge in this field. The new work in this field is the search of the data in a search tree pattern based on tree. Various sequential mining algorithms have been devoloped till date. Web usage mining is used to operate the web server logs, that contains the navigation history of the user. Recommendater system is explained properly with the explanation of whole procedure of the recommendater system. The search results of the data leads to the proper ad efficient search. But the problem was the time utilization and the search results generated from them. So, a new local search algorithm is proposed for country-wise search that makes the searching more efficient on local results basis. This approach has lead to an advancement in the search based methods and the results generated.
Enhance Crawler For Efficiently Harvesting Deep Web Interfacesrahulmonikasharma
Scenario in web is varying quickly and size of web resources is rising, efficiency has become a challenging problem for crawling such data. The hidden web content is the data that cannot be indexed by search engines as they always stay behind searchable web interfaces. The proposed system purposes to develop a framework for focused crawler for efficient gathering hidden web interfaces. Firstly Crawler performs site-based searching for getting center pages with the help of web search tools to avoid from visiting additional number of pages. To get more specific results for a focused crawler, projected crawler ranks websites by giving high priority to more related ones for a given search. Crawler accomplishes fast in-site searching via watching for more relevant links with an adaptive link ranking. Here we have incorporated spell checker for giving correct input and apply reverse searching with incremental site prioritizing for wide-ranging coverage of hidden web sites.
Identifying the Number of Visitors to improve Website Usability from Educatio...Editor IJCATR
Web usage mining deals with understanding the Visitor’s behaviour with a Website. It helps in understanding the concerns
such as present and future probability of every website user, relationship between behaviour and website usability. It has different
branches such as web content mining, web structure and web usage mining. The focus of this paper is on web mining usage patterns of
an educational institution web log data. There are three types of web related log data namely web access log, error log and proxy log
data. In this paper web access log data has been used as dataset because the web access log data is the typical source of navigational
behaviour of the website visitor. The study of web server log analysis is helpful in applying the web mining techniques.
Comparison of Routing protocols in Wireless Sensor Networks: A Detailed Surveytheijes
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
The papers for publication in The International Journal of Engineering& Science are selected through rigorous peer reviews to ensure originality, timeliness, relevance, and readability.
Theoretical work submitted to the Journal should be original in its motivation or modeling structure. Empirical analysis should be based on a theoretical framework and should be capable of replication. It is expected that all materials required for replication (including computer programs and data sets) should be available upon request to the authors.
The International Journal of Engineering & Science would take much care in making your article published without much delay with your kind cooperation
Stratigraphy and Lithology of Naokelekan Formation in Iraqi Kurdistan-Reviewtheijes
The study depends on analyses of slabs, thin sections, and smear slides of 44 samples of 18 outcrops of Naokelekan Formation from Iraqi Kurdistan. The study revealed that the age of Naokelekan Formation is Callovian-Upper Oxfordian. The Cyclagelosphaera margerelii sp. indicates restricted marine environment while the Watznaueria barnesiae sp. point to high latitude geographic location of depositional basin which was warm water that was characterized by low-nutrient. The field observations and nannofossils revealed that the Middle and Upper parts of Naokelekan Formation either eroded or were not deposited in uppermost northwestern Iraq.
On Integrablity Of F-Structure Satisfying F 2K+1 +F=0theijes
The purpose of this paper is to study integrability of the F-structure satisying F 2K+1+ F=0, where K is a positive integer. Nijenhuis tensor, metric F-structure, fundamental 2-form have also been discussed.
Effect of Malting and Fermentation on the Proximate Composition and Sensory P...theijes
Four maize flour samples comprising non-malted non-fermented maize (NMNFZ), non-malted fermented maize (NMFZ), malted non-fermented maize (MNFZ), malted fermented maize (MFZ) flour were blended with African yam bean flour to yield test flours consisting of NMNFZB, NMFZB, MNFZB and MFZB with 16g protein/100g flour each. Native maize flour was used as control. The test flours were used for production of tortilla designated as NMNFZBT, NMFZBT, MNFZBT and MFZBT respectively with NT (native tortilla) as control. Proximate composition and sensory attributes of the tortilla products were evaluated using standard methods. Malting and fermentation resulted in apparent increase in protein content of maize from 11.25g/100g solids (NMNFZ) to 11.67g/100g solids (MFZ). Complementation with African yam bean increased the protein content of the test flours. Crude protein values of the tortilla products ranged from 16.27g/100g solids (NMNFZBT) to 21.68g/100g solids (MFZBT). The MFZBT had the lowest carbohydrate content (59.17g/100g solids) while NMNFZBT had the highest value of 68.87g/100g solids. MFZBT had the highest values of 8.75, 1.35 and 5.77g/100g solids for moisture, fibre and ash contents respectively. NMNFZBT had the highest energy value of 1510.11kJ/100g. The flavour of the tortillas improved significantly (p<0.05) with MFZBT having the highest overall acceptability mean score (8.30±0.20).
Design of Multi Link Structure for Rear Suspenion of a Heavy Vehicletheijes
Automobile systems today is going through major changes and as concert to comfort the suspension system and it’s working is very important. The study of four link suspension system and dynamic analysis are discussed in this paper. This paper discusses the design problem of vehicles using four-link suspension systems with the aim of totally optimizing vehicle handling and stability. Since this problem includes many evaluation items, and Four-link suspension system has interconnected behaviour, the optimization is so complicated. An efficient and computable model is indispensable for compromising the total optimization. This paper investigates a structure of objectives, introduces appropriate simulation models for respective items; we apply multi body dynamic analysis to plot the varies terms such as wheel travel, camber angle, caster angle, toe-in, toe-out etc. The result of optimization calculation shows the validity of the optimization model
Different Solutions to a Mathematical Problem: A Case Study of Calculus 12theijes
An important duty of the mathematics teacher is to train and to develop thinking for students. To accomplish this duty, teachers can organize creative activities for students through activities of solving problems. In particular, there is an effective way to train students to think is that teachers can organize activities of solving problems in many different ways. Based on this idea, we implement an experiment for students in grade 12 to calculate integrals in various ways. The results of the study showed that students were active to find out different solutions to the given problem.
Application of Very Low Frequency- Electromagnetic (VLF-EM) Method to Map Fra...theijes
Geophysical survey involving very low frequency electromagnetic technique was applied to investigate possible geologic features like fractured / conductive zones in Auchi and its environs in Edo state, Southwestern Nigeria. The study area is located within latitudes 7o 05’ N.to 7o 10’N. and longitudes 6 o 11’E to 6o 22’E The geologic Formations outcroping in the area are mainly Ajali and Nsukka. Three profiles were taken along the roads from Auchi to Igara, Auchi to Fugar and Auchi to Uloke using Abem Wadi Terrameter. Plots of the profiles were carried out using computer software (Excel) and contouring using Surfer 10 to delineate the fractured/conductive zones. The values range from 0.3 to 22.5 Siemens. Areas of low conductivity values indicate highly massive resistive rocks while Areas of high conductivity indicates the sedimentary terrain/ host rock or mineralized zones. The area is sparsely (few) fractured. Along profile A, two fractured zones were identified with conductivity values of 7.6 to16.8 Siemens between 100m(7.146oN,6.195oE) to 400m (7.150oN, 6.200oE) and 420m to 460m with conductivity value range of 11.0 to 22.5 Siemens. For profile B, one fractured zone was identified and a stretch of massive intrusive from 7.099oN and 7.102oN and 6.357oE to 6.364oE, with conductivity range of 0.9 – 5.2 Siemens at points 400m and 520m – 1000m. Profile C has identifiable fractured zones at 900m – 1100m with conductivity of (35 – 50) Siemens. The intrusive/ host rock conductivity values of (0.3 – 8.7) Siemens located at 380m to 880m 7.156oN and 6.308oE, 1100m to 2000m, 7.148oN and 6.3295oE. A total of five conductive zones were observed.
Effect of Utilizing Geometer’s Sketchpad Software on Students’ Academic Achie...theijes
The study is carried out in order to measure the effectiveness of “Geometer’s Sketchpad software” inside the classroom environment and analyzed how this training is helping high school students while solving mathematics problems. In order to measure the effectiveness, regression and co-relation analysis has been done and finally the mean responses have been analyzed to evaluate the method effectiveness correctly on SPSS computer statistic program.
The Effect of Profitability on Firm Value in Manufacturing Company at Indones...theijes
The purpose of study was to analyze and explain the effect of profitability on firm value. The data used in this study was secondary data obtained from a manufacturing company located in the Indonesia Stock Exchange. The population of this research is manufacturing various industry sub-sectors listed in Indonesia Stock Exchange as research objects. Period manufacturing various industry sub-sectors used in the study covers a period of six years, i.e. 2009 to 2014. The method of data analysis used in this study was path analysis which is a multiple regression equation groove connected simultaneously, and technical analysis the data in this study using analysis software SmartPLS 2.0. The results of data analysis proves that the profitability has affect the firm value because the value is a positive on the achievement of profit to justify the payment of dividends, so the stock price will increase because the company showed a positive signal to pay dividends.
Reliability Prediction of Port Harcourt Electricity Distribution Network Usin...theijes
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
The papers for publication in The International Journal of Engineering& Science are selected through rigorous peer reviews to ensure originality, timeliness, relevance, and readability.
Theoretical work submitted to the Journal should be original in its motivation or modeling structure. Empirical analysis should be based on a theoretical framework and should be capable of replication. It is expected that all materials required for replication (including computer programs and data sets) should be available upon request to the authors.
The International Journal of Engineering & Science would take much care in making your article published without much delay with your kind cooperation
Interleaved High Step-Down Synchronous Convertertheijes
For low output voltage, high output current systems applications, Synchronous switching power converters give better performance than non synchronous converters. This paper presents an interleaved synchronous buck converter which has low switch voltage stress with high conversion ratio. The input current can be shared among the inductors so that high reliability and efficiency can be obtained and ripples also reduced, the converter performance can be improved. Thus converter features automatic uniform current sharing characteristic of the interleaved phases without adding extra circuitry or complex control methods. Capacitors switching circuits are combined with interleaved four-phase buck converter for getting a high step-down conversion ratio without adopting an extreme short duty ratio. Synchronous rectifier technology is adopted to increase the converter efficiency. A 30V input voltage, 1.8V output voltage, circuit is simulated to verify the performance. The simulation is done in MATLAB/SIMULINK R2012a.
Wind-induced Stress Analysis of Front Bumpertheijes
At high velocities, such as on highways, the relative velocity between the oncoming wind and side winds is very high. The high velocity winds that act on the bumper induce certain stresses on it. These stresses may cause deformation of the bumper; if this deformation exceeds a predesigned value, the functionality of the bumper may be hampered. This may result in safety issues and other design issues. In this paper, the effect and nature of these stresses have been quantified by conducting a wind-induced stress analysis on a model of the bumper. The bumper selected is that of Jeep Wrangler and the modelling is done on Creo 2.2. The CFD simulation and structural analysis is conducted on Ansys Workbench 15. The structural analysis and fluid flow data is summarized alongwith the deformation and induced stress values.
Generation of Electricity through Speed Breaker Mechanismtheijes
In the current scenario demand of power is increasing day by day with increasing population. On the other hand energy crisis is also a main issue of today’s life and all there is a shortage of conventional energy resources due to its large usage. So, we have to sort out this problem with a technique which will not only overcome this energy crisis but also should be eco-friendly. Many conventional resources are creating pollution so that’s why focus is towards eco-friendly solution. This project emphasizes on idea which shows that power could be generated by specially designed speed breakers. A large amount of kinetic energy is being wasted on roads on daily basis in different forms which could be use to generate power and this power can be stored in batteries. This project shows clearly how we can generate power by using rack-pinion method where basically linear motion is converted into rotary motion and then can be used to generate electricity. Large amount of electricity can be generated using this method and this method is eco-friendly.
Prevalence of Malaria Infection and Malaria Anaemia among Children Attending ...theijes
Malaria associated anaemia represent a major public health problem. Thestudy considered Out-Patient children at Emergency Paediatric Unit, Federal Medical Centre, Yola aged 6 months-15 years from June to November 2015. Questionnaires were used to collect information relating to gender, age and parents/guardians sociodemographic characteristics. Microscopic examination of Thick and Thin blood films a technique was employed, Pack Cell Volumewas used to screen for anaemia. Of the 168 children sampled, the prevalence of malaria infection and malaria anaemia was 29.2% and 26.2% respectively and it was associated with P. falciparum. Malaria infection in relation to anaemia, children with mild anaemia (47.6%) had the highest infection rate. It was observed that malaria infection was higher among males (32.2%) than the females (25.6%), age group 5-9 years (34.2%) had the highest malaria infection and least was ≥15 years (20.0%) but these were statistically insignificant within gender and age of the children and malaria infection (p˃0.05). Higher malaria infection among children whose parents/guardians were unemployed (38.5%), attended primary education (52.6%) and reside in village setting (31.4%). Malaria anaemia in relation to children epidemiological data, males (31.6%), 5-9 years (31.6%) recorded with high prevalence rate while sociodemographic characteristics of parents/guardians, children whose parents/guardians were civil servant (18.9%), attended tertiary education (13.8%) and live in quarters (11.1%) had the least prevalence rate of malaria anaemia. Children gender, parents/guardians occupation and educational qualification were significantly associated with malaria anaemia (p˂0.05). Therefore, parents/guardians sociodemographic factors such as better occupation, higher educational qualification and well layout and refined area of residence reduces the prevalence of malaria infection and malaria anaemia in children. There is need to sensitized public on the importance of management of malaria and the possible effects of malaria anaemia on children in order to circumvent the menace.
Failure Analysis of Feedstock Preheater Unit of the Kaduna Refinery using Fai...theijes
The use of failure modes effects and criticality analysis (FMECA) as a failure or reliability analysis tool, checks the probabilities that an item will perform a required function under stated condition(s) when operated properly.Failure analysis of process equipment is an important issue in any process industry. This study aims at analyzing the failure of feedstock preheater unit of the Kaduna Refining and Petrochemicals (KRPC), Fluid Catalytic Cracking Unit (FCCU), using the failure mode, effects and criticality analysis (FMECA). The unit failure and its effects were identified through seven sub-units (fresh feed surge drum, heavy naphtha exchanger, light cycle oil exchanger, heavy cycle oil exchanger, fractionator bottom exchanger, feed preheater and fresh feed charge pump), using the failure mode effects analysis (FMEA). Both quantitative and qualitative criticality analyses (CA) were used for failure analysis of the unit (feedstock preheater). For the qualitative analysis, items risk priority number (RPN) were computed and it was found that, four sub-units (heavy naphtha exchanger, main fractionator bottom exchanger, feedstock preheater, and fresh feed charge pump) had their Risk Priority Number (RPN) greater than 200, these sub-units are said to be critical. Three of the sub-units (fresh feed surge drum, light cycle oil exchanger, and heavy cycle oil exchanger) had their RPN less than 200, these sub-units are said to be less critical. For the quantitative analysis, items criticality number (Cr) were computed and it was found that most of the sub-units had their Cr>0.002. In addition, the results of the criticality matrix showed that, eight out of the sixteen failure modes identified were above or closely below the criticality line. Finally, FMECA was effectively used for failure analysis of the feedstock preheater andpredictive maintenance was recommended.
Simulation of Deep-Drawing Process of Large Panelstheijes
The article deals with the analysis of formability of deep-drawing DC06 steel sheets. The aim of the investigations is to verify possibilities of formability of sheet metal with thickness of 0.85 mm. The mechanical parameters of the sheets have been determined in uniaxial tensile and bulge tests. The numerical simulations using AUTOFORM has been carried out for two drawpiece models. Obtained results can be used during the simulation of real forming process.
Research on Terminal Distribution Mode of Express System in University Towntheijes
Terminal distribution is the key point in express delivery, because it affects the service quality and customer perception. University park. Huge online shopping orders lead to a lot of problems on terminal distribution in university town. At present, feasible terminal express service pattern in university town are tradition express service point, intelligent delivery lockers and express convenience stores. Taking Shanghai Songjiang university town, for example, has carried on the detailed comparison, An appropriate terminal distribution mode is obtained, from the initial investment costs, operating costs, customer preferences, customers convenient and business situation.
Comparative Investigation Ofinter-Satellite Optical Wireless Communication By...theijes
The optical wireless communication systems have got greater popularity in the previous couple of years because of its benefit over conventional radio frequency conversation structures. This paper reviews the effect of the usage of NRZ ,RZ and Gaussian pulse generator modulation codecs on the performance of the optical wireless communique (OWC) channel in terms of Quality factor aspect and Minimum BERat bit rate of 10 Gbps. It has been watched that NRZ function generator gives better execution for optical wireless association conversely with RZ and Gaussian association for different values of aperture diameters and range.
International conference On Computer Science And technologyanchalsinghdm
ICGCET 2019 | 5th International Conference on Green Computing and Engineering Technologies. The conference will be held on 7th September - 9th September 2019 in Morocco. International Conference On Engineering Technology
The conference aims to promote the work of researchers, scientists, engineers and students from across the world on advancement in electronic and computer systems.
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
The papers for publication in The International Journal of Engineering& Science are selected through rigorous peer reviews to ensure originality, timeliness, relevance, and readability.
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logsijsrd.com
With an expontial growth of World Wide Web, there are so many information overloaded and it became hard to find out data according to need. Web usage mining is a part of web mining, which deal with automatic discovery of user navigation pattern from web log. This paper presents an overview of web mining and also provide navigation pattern from classification and clustering algorithm for web usage mining. Web usage mining contain three important task namely data preprocessing, pattern discovery and pattern analysis based on discovered pattern. And also contain the comparative study of web mining techniques.
IJRET : International Journal of Research in Engineering and Technology is an international peer reviewed, online journal published by eSAT Publishing House for the enhancement of research in various disciplines of Engineering and Technology. The aim and scope of the journal is to provide an academic medium and an important reference for the advancement and dissemination of research results that support high-level learning, teaching and research in the fields of Engineering and Technology. We bring together Scientists, Academician, Field Engineers, Scholars and Students of related fields of Engineering and Technology
Recommendation generation by integrating sequential pattern mining and semanticseSAT Journals
Abstract As the Internet usage keeps increasing, the number of web sites and hence the number of web pages also keeps increasing. A recommendation system can be used to provide personalized web service by suggesting the pages that are likely to be accessed in future. Most of the recommendation systems are based on association rule mining or based on keywords. Using the association rule mining the prediction rate is less as it doesn’t take into account the order of access of the web pages by the users. The recommendation systems that are key-word based provides lesser relevant results. This paper proposes a recommendation system that uses the advantages of sequential pattern mining and semantics over the association rule mining and keyword based systems respectively. Keywords: Sequential Pattern Mining, Taxonomy, Apriori-All, CS-Mine, Semantic, Clustering
`A Survey on approaches of Web Mining in Varied Areasinventionjournals
There has been lot of research in recent years for efficient web searching. Several papers have proposed algorithm for user feedback sessions, to evaluate the performance of inferring user search goals. When the information is retrieved, user clicks on a particular URL. Based on the click rate, ranking will be done automatically, clustering the feedback sessions. Web search engines have made enormous contributions to the web and society. They make finding information on the web quick and easy. However, they are far from optimal. A major deficiency of generic search engines is that they follow the ‘‘one size fits all’’ model and are not adaptable to individual users.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting valuable information from web databases. Web mining is one of the data mining domains where data mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer behaviour, evaluate a particular website based on the information which is stored in web log files. Web mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library. Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase the complexity of dealing information from different web service providers. The collection of information becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting
valuable information from web databases. Web mining is one of the data mining domains where data
mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer
behaviour, evaluate a particular website based on the information which is stored in web log files. Web
mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library.
Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase
the complexity of dealing information from different web service providers. The collection of information
becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting valuable information from web databases. Web mining is one of the data mining domains where data mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer behaviour, evaluate a particular website based on the information which is stored in web log files. Web mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library. Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase the complexity of dealing information from different web service providers. The collection of information becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
Web is a collection of inter-related files on one or more web servers while web mining means extracting valuable information from web databases. Web mining is one of the data mining domains where data mining techniques are used for extracting information from the web servers. The web data includes web
pages, web links, objects on the web and web logs. Web mining is used to understand the customer behaviour, evaluate a particular website based on the information which is stored in web log files. Web mining is evaluated by using data mining techniques, namely classification, clustering, and association
rules. It has some beneficial areas or applications such as Electronic commerce, E-learning, Egovernment, E-policies, E-democracy, Electronic business, security, crime investigation and digital library. Retrieving the required web page from the web efficiently and effectively becomes a challenging task
because web is made up of unstructured data, which delivers the large amount of information and increase the complexity of dealing information from different web service providers. The collection of information becomes very hard to find, extract, filter or evaluate the relevant information for the users. In this paper,
we have studied the basic concepts of web mining, classification, processes and issues. In addition to this,
this paper also analyzed the web mining research challenges.
The International Journal of Engineering and Science (The IJES)theijes
The International Journal of Engineering & Science is aimed at providing a platform for researchers, engineers, scientists, or educators to publish their original research results, to exchange new ideas, to disseminate information in innovative designs, engineering experiences and technological skills. It is also the Journal's objective to promote engineering and technology education. All papers submitted to the Journal will be blind peer-reviewed. Only original articles will be published.
Similar to Comparable Analysis of Web Mining Categories (20)
Immunizing Image Classifiers Against Localized Adversary Attacksgerogepatton
This paper addresses the vulnerability of deep learning models, particularly convolutional neural networks
(CNN)s, to adversarial attacks and presents a proactive training technique designed to counter them. We
introduce a novel volumization algorithm, which transforms 2D images into 3D volumetric representations.
When combined with 3D convolution and deep curriculum learning optimization (CLO), itsignificantly improves
the immunity of models against localized universal attacks by up to 40%. We evaluate our proposed approach
using contemporary CNN architectures and the modified Canadian Institute for Advanced Research (CIFAR-10
and CIFAR-100) and ImageNet Large Scale Visual Recognition Challenge (ILSVRC12) datasets, showcasing
accuracy improvements over previous techniques. The results indicate that the combination of the volumetric
input and curriculum learning holds significant promise for mitigating adversarial attacks without necessitating
adversary training.
Hierarchical Digital Twin of a Naval Power SystemKerry Sado
A hierarchical digital twin of a Naval DC power system has been developed and experimentally verified. Similar to other state-of-the-art digital twins, this technology creates a digital replica of the physical system executed in real-time or faster, which can modify hardware controls. However, its advantage stems from distributing computational efforts by utilizing a hierarchical structure composed of lower-level digital twin blocks and a higher-level system digital twin. Each digital twin block is associated with a physical subsystem of the hardware and communicates with a singular system digital twin, which creates a system-level response. By extracting information from each level of the hierarchy, power system controls of the hardware were reconfigured autonomously. This hierarchical digital twin development offers several advantages over other digital twins, particularly in the field of naval power systems. The hierarchical structure allows for greater computational efficiency and scalability while the ability to autonomously reconfigure hardware controls offers increased flexibility and responsiveness. The hierarchical decomposition and models utilized were well aligned with the physical twin, as indicated by the maximum deviations between the developed digital twin hierarchy and the hardware.
Water scarcity is the lack of fresh water resources to meet the standard water demand. There are two type of water scarcity. One is physical. The other is economic water scarcity.
Student information management system project report ii.pdfKamal Acharya
Our project explains about the student management. This project mainly explains the various actions related to student details. This project shows some ease in adding, editing and deleting the student details. It also provides a less time consuming process for viewing, adding, editing and deleting the marks of the students.
Final project report on grocery store management system..pdfKamal Acharya
In today’s fast-changing business environment, it’s extremely important to be able to respond to client needs in the most effective and timely manner. If your customers wish to see your business online and have instant access to your products or services.
Online Grocery Store is an e-commerce website, which retails various grocery products. This project allows viewing various products available enables registered users to purchase desired products instantly using Paytm, UPI payment processor (Instant Pay) and also can place order by using Cash on Delivery (Pay Later) option. This project provides an easy access to Administrators and Managers to view orders placed using Pay Later and Instant Pay options.
In order to develop an e-commerce website, a number of Technologies must be studied and understood. These include multi-tiered architecture, server and client-side scripting techniques, implementation technologies, programming language (such as PHP, HTML, CSS, JavaScript) and MySQL relational databases. This is a project with the objective to develop a basic website where a consumer is provided with a shopping cart website and also to know about the technologies used to develop such a website.
This document will discuss each of the underlying technologies to create and implement an e- commerce website.
1. The International Journal Of Engineering And Science (IJES)
|| Volume || 5 || Issue || 5 || Pages || PP -27-31|| 2016 ||
ISSN (e): 2319 – 1813 ISSN (p): 2319 – 1805
www.theijes.com The IJES Page 27
Comparable Analysis of Web Mining Categories
1
Anmol Kaur , 2
Dr.Raman Maini
1
M.Tech Student ,Department of Computer Engineering Punjabi University, Patiala , Patiala, India
2
Professor Department of Computer Engineering Punjabi University, Patiala Patiala, India
----------------------------------------------------------ABSTRACT-----------------------------------------------------------
Web Data Mining is the current field of analysis which is a combination of two research area known as Data
Mining and World Wide Web. Web Data Mining research associates with various research diversities like
Database, Artificial Intelligence and Information redeem. The mining techniques are categorized into various
categories namely Web Content Mining, Web Structure Mining and Web Usage Mining. In this work, analysis of
mining techniques are done. From the analysis it has been concluded that Web Content Mining has
unstructured or semi- structure view of data whereas Web Structure Mining have linked structure and Web
Usage Mining mainly includes interaction.
---------------------------------------------------------------------------------------------------------------------------------------
Date of Submission: 18 April 2016 Date of Accepted: 05 May 2016
---------------------------------------------------------------------------------------------------------------------------------------
I. Inroduction
The knowledge can be spread through World Wide Web . Internet has become an important part of our today’s
busy schedule. It has turned out the manner of working business, education handling the organisation etc. The
Web is large collection of information which is huge and dynamic in nature. That’s why the complexity also
increases to handle this abundant data. But the user satisfaction is must. Because the user want the perfect
answer about the topic which he want to search. Different users have different needs and level of satisfaction
according to their area or field. Like students want to examine the answer about the topics of study, business
mind people like to analyse the customer’s requirements. Everyone wants techniques to meet their needs.
Mining can be implemented to find the Data Mining tools to find the required knowledge from internet. This
gathered information is apply to obtain more command and observe the information make forecast, what would
the right option and the fair appeal to move go ahead [3]. According to studies, Web Mining can be categorized
into the following three types of categories namely Web Content Mining, Web Structure Mining, Web Usage
Mining respectively. Web mining techniques are decomposed into the following subtasks:
1. Resource Discovery: it takes care to find the web information from various sources.
2. Information selection and pre-processing: the data which is collected from web is selected and pre-
process it automatically.
3. Generalization: patterns are automatically discovered at both the various sites and discrete sites.
4. Analysis: it certify the data which is mined [7].
II. Different Categories of Web Data Mining
Description of different types of Web Data Mining namely Web Content Mining, Structure Mining and Web
Usage Mining. This categorization is depicted in Figure 1.
2. Comparable Analysis of Web Mining Categories
www.theijes.com The IJES Page 28
Fig 1. Web Mining Categorization [3]
2.1. Web Content Mining
The Web Content Mining explains the automatic exploration of information which is available on web.
Content Mining deal with examining of topic, images and graphs to discover the applicable material. Many of
them are semi-structured or some unstructured in nature. This examining is over when the collection is done
through structure mining and produces result build upon the stage of applicability. With the bulky load present
on the web this mining gives the results based upon the priority [2].The Web Content Mining is aimed toward
particular data regulated by the client search information in the various search engines. This grants for the
scanning of the whole Web to gather the chunk of content triggering the scanning of peculiar Web pages which
are within those clusters. The outcome pages communicate with search engines with the order of highest level to
lowest level. This mining allows to minimize the inappropriate data [2;3].
The Content mining is effectual or constructive when it is used while dealing with particular database. As for
example online universities utilize a library system which helps in recalling articles relevant to their fields of
study. This particular database allows to drag only those information related with the concerned subjects. The
advantage of this mining is that it categorize, organize and provides the possible results available on the internet.
Web mining helps to raise prolific uses of mining for business, web designing and search engines operations[2].
The Web content or Text Mining can be distinguish from the two views namely Database View and Information
Retrieval View. unstructured documents can be represented through the use suitcase of words. This
representation ignores the order of the occurrence of words. The feature could be Boolean (i.e whether the word
occurs or do not occur in the related article), or frequency based(ie how many times the particular word repeats).
The features could be extracted by using some mining techniques such as cross entropy, mutual information or
information gain [2]. Latent Semantic Indexing that translate the real document into lower dimensional space by
observing the co-relational composition of the document cluster such that same documents that do not share
terms would be placed together in the same category and stemming which reduces words to their morphological
roots like “collection”, “collect”,“ collected”, “collecting” would be stemmed to their common root “collect”
and only the latter word is used as the feature instead of the former four[6]. But on the other hand, Database
Approach is mainly used to handle the unstructured data into the structured data by using related Data Mining
techniques.
Multilevel Database: This approach mainly focus on that the unstructured data of low level is stored in
several web databases, like HTML documents. But the generalizations are made at the upper level which
results into the organised structure [4].
Web Query Systems: Database query language such as SQL is used by some web refer systems, eg.W3QL
[4].
2.2. Web Structure Mining
The structured summary of web page and web site can be identified through web structure mining. The
structured information is discoverable due to the availability of database techniques for web pages. Web Content
Mining generally deals with the inner-document structure but, the Structure Mining focuses on the structure of
the links of hyperlinks at the inter-document level mainly. The Web Structure Mining generally explains the
web pages and produces the related information about the peculiar topic. Use of the Structure Mining reduces
the two major issues of the web which occurs because of the abundant amount of data available [2]. The two
major problems can be defined as following:
3. Comparable Analysis of Web Mining Categories
www.theijes.com The IJES Page 29
1. Unrelated conclusion of search: distortion occurs for corrected search as a result of search engines which
allow for precision method.
2. Availability of the abundant data: Another problem is the indexing of large data quantity available on
internet. The above reduction is a functional part of ascertaining the model beneath the web hyperlink
structure given by the web structure mining.[3].
This mining extracts the unknown relationship between the web pages. This mining provides usage of link
knowledge of one’s own website endowing navigation as well as chunk of data into site maps. The relevant
information can be promised with the use of keywords. Hyperlink command chain is decided to acquire related
information within the sites as a relationship between competitor links and connection by means of third party
co-link and search engines. Web Structure Mining also helps in establishing the similar structure of web pages
by means of clustering technique. If there are huge web crawlers there will be more beneficial desired results to
the related search [5].
If the web pages are directly linked with each another or web pages are neighbour we could find the relation
among them. The relations may fall in category of ontology, they may have similar contents. Web Structure
Mining also leads to generalize the sequence or networks of hyperlinks in the Websites in some particular
domain. This leads to judgement of flow information in sites and this leads to easy and efficient query
processing [1]. During 1997-1998, two most powerful hyperlink–based search algorithms Page Rank and Hits
were introduced, which are HITS and Page Rank Algorithm and improvement of Hits by adding content
information to the links structure and by using outlier filtering. These methods are mainly used to calculate the
quality rank of each webpage.
Hyperlinks mainly act as useful for the following[1].
Exploring the real pages link.
Suggesting the pages with authority on the similar subjects the page containing the link.
Fig 2. Web Graph Structure [6]
2.3. Web Usage Mining
The Web Usage mining is the third type of categorization. This permits us to gathering information for web
pages. The related information is collected automatically into access logs. Mostly the organisations collect the
daily logs who access the internet, for how much time, what sites he had visit via CGI scripts. By examining this
organizations regulate promotional struggle, and customers life time[3]. Example online selling advertisement
on the web. This mining also regulate the best path for the services[3]. Most existing tools provide the
information about the user logs. By using such tools we can easily find out the details of user to determine that
how many times one visit a particular site, name of the domain and the URLs of the user. But the traffic is
handle from low to moderate side by these tools. More advanced systems are designed to discover and analysis
of the patterns. The tools are differentiated into the following types namely:
1. Pattern Discovery Tool: There are some rising tools which are used to discover the patterns from
techniques like Data Mining, Psychology and information theory ,Artificial intelligence to extract the
knowledge from pool of information. Example WEBMINER System. It helps to discover association rule
and sequential patterns from access logs automatically. [4]
2. Pattern Analysis Tool: After the finding of the patterns analysts wants the most suitable tool to envisage
and interpret the patterns using the OLAP technique to discover the patterns [4]. eg. WebSIFT: It stands for
Web Site Information Filter System. This Web Site Information Filter System act as a framework for web
usage mining and use the structured information and content information to find the results. This is also
fertilized research area [3]. Along creation of sever session, WebSIFT also execute content and structure
pre-processing. The sequential pattern analysis, association rule discovery are performed on the session
files to find the pattern analysis [3].
4. Comparable Analysis of Web Mining Categories
www.theijes.com The IJES Page 30
Fig 3. Web Usage Mining Process [4]
III. Analysis between the different categories of Web Data Mining
Web Content Mining Web Structure Mining Web Usage Mining
Objective Generally used to
discover knowledge,
collection of document
like images and videos
Mainly structure of the
link can be examined and
also desirable documents
can be determined
the behaviour of users
can be determined during
their interaction with
World Wide Web
Technology Used Machine learning and
Automatic extraction
Information can be
accessed through
reference schema using
database technololgy
Association, Clustering ,
classification
Data view point Unstructured or semi-
structured
Linked composition associated
Application Information extraction,
segmenting web pages
and detecting noise
Used in business to
determine the link among
various sites
Used in online auction,
E-Banking, E-Commerce
transaction, E-Commerce
customer behaviour
analysis
Table 1. Web Mining Categories [1]
IV. Conclusion
The Web Data Mining provides the way to categorize a required information. This mining helps to introduce the
discovering of new tools to extract the valuable information from the sources. It gives the relevant results for the
queries find out on the World Wide Web. The most related answers get find out through the use of techniques.
In this work, Comparative analysis of the mining techniques has been done and it has been concluded that Web
Content Mining has unstructured or semi- structure view of Data Mining and explains the automatic exploration
of the information which could be accessible from the web . Whereas Structure Mining have linked structure at
the inter- document level and helps to solve the problem of irrelevant results. The major use of the web structure
mining is to find the hidden relationships between the Web pages. On the other hand, Web Usage Mining
mainly includes interaction. Web Usage Mining provides the details of web access logs. The existing tools like
WEBMINER helps to conclude the results of various web logs. There are many fields which could take the
benefit of these mining categories. Web Usage Mining applications like E-banking, E-Commerce customer
analysis and their problems require further research.
5. Comparable Analysis of Web Mining Categories
www.theijes.com The IJES Page 31
References
[1] Jaideep Srivastava, Robert Cooleyz , Mukund Deshpande, Pang-Ning Tan, “Web Usage Mining: Discovery and Applications of
Usage Patterns from Web Data”,SIGKDD Explorations volume-1, Issue-2, Jan 2000
[2] Raymond Kosala and Hendrik Blockeel,” Web Mining Research: A Survey”,ACM SIGKDD, July 2000
[3] Yan Wang,” Web Mining and Knowledge Discovery of Usage Patterns”,CS 748T Project (Part I), February, 2000.
[4] R. Cooley, B. Mobasher, and J.Srivastava,” Web Mining: Information and Pattern Discovery on the World Wide Web”,Ninth
IEEE International Conference, IEEE, Nov 1997.
[5] Robert Cooley, Bamshad Mobasher, and Jaideep Srivastava ,”Data Preparation for Mining World Wide Web Browsing
Patterns”,Supported by NSF Grant, Oct 1998.
[6] R. Kosala, H. Blockeel, “Web Mining Research: A Survey”, in SIGKDD Explorations 2(1), ACM, July 2000.
[7] B. Masand, M. Spiliopoulou, J. Srivastava, O. Zaiane, ed. Proceedings of “WebKDD2002 –Web Mining for Usage Patterns and
User Profiles”, Edmonton, CA, 2002.
[8] R. Kohavi, “Mining E-Commerce Data: The Good, the Bad, the Ugly”, Invited Industrial presentation at the ACM SIGKDD
Conference, San Francisco, CA, 2001.
[9] M. Spiliopoulou, “Data Mining for the Web”, Proceedings of the Symposium on Principles of Knowledge Discovery in
Databases (PKDD), 1999.
Web References
[1] https://sites.google.com/site/assignmentssolved/mca/semester6/mc0088/14
[2] http://www.web-datamining.net/
[3] https://www.google.co.in/