This document summarizes a research project that aims to build a predictive model to classify business web pages and reduce the volume of pages that need to be crawled. The goal is to predict which pages are most likely to contain key information like a business address or leadership team members. Currently, companies must crawl all pages on a website to extract this data, which is costly.
The research will use machine learning algorithms like logistic regression, random forest and Naive Bayes on a dataset of over 11,000 web pages. Features will include information extracted from the URL text and page content. Models will be built separately to classify pages for address information and executive information since these are separate prediction tasks. Performance will be evaluated using AUC since the data
International Journal of Engineering Research and Development (IJERD)IJERD Editor
call for paper 2012, hard copy of journal, research paper publishing, where to publish research paper,
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals,
yahoo journals, bing journals, International Journal of Engineering Research and Development, google journals, hard copy of journal
RECOMMENDATION FOR WEB SERVICE COMPOSITION BY MINING USAGE LOGSIJDKP
Web service composition has been one of the most researched topics of the past decade. Novel methods of
web service composition are being proposed in the literature include Semantics-based composition, WSDLbased
composition. Although these methods provide promising results for composition, search and
discovery of web service based on QoS parameter of network and semantics or ontology associated with
WSDL, they do not address composition based on usage of web service. Web Service usage logs capture
time series data of web service invocation by business objects, which innately captures patterns or
workflows associated with business operations. Web service composition based on such patterns and
workflows can greatly streamline the business operations. In this research work, we try to explore and
implement methods of mining web service usage logs. Main objectives include Identifying usage
association of services. Linking one service invocation with other, Evaluation of the causal relationship
between associations of services.
This document proposes a new advanced e-commerce recommendation system that incorporates additional data beyond a user's purchase history to provide improved recommendations. Specifically, it suggests combining a user's search and purchase history with data from social media platforms like Twitter, reviews from sites like Amazon and Snapdeal, the user's location and preferences. This additional data could allow the system to recommend items a user has not previously purchased. It also describes some limitations of existing recommendation systems that only consider purchase history or require clustering users. The proposed system would use clustering and collaborative filtering together to group similar products and then make recommendations based on the evaluations within each cluster to improve accuracy.
Web usage Mining Based on Request Dependency GraphIRJET Journal
This document discusses using request dependency graphs (RDGs) to model the dependency relationships between HTTP requests for web usage mining. RDGs can improve data quality and enhance network and web server performance. The authors evaluated their approach using a large real-world web access log and found that RDGs are a useful tool for web usage mining by extracting patterns from user access behaviors and decomposing websites.
Calculating Rank of Web Documents Using Its Content and Link AnalysisIRJET Journal
This document proposes a method for calculating the rank of web documents using both content analysis and link analysis to improve search results. It discusses how current search engines use page ranking based only on hyperlinks. The proposed method first fetches links and content from the top search results for a query. It creates a dictionary to evaluate a score for each document based on how many times the query terms appear. It also considers synonyms from WordNet. The scores are then used to reorder the results to surface the most relevant pages. The algorithm is demonstrated on sample queries. Experimental results show the original and reordered rankings. The goal is to reduce irrelevant results at the top by incorporating both link and content information.
1. The document describes a search engine scraper that extracts data from websites, summarizes the extracted information, and converts it into a relevant result for users.
2. The search engine scraper works in three stages: extraction of data from website content, summarization of the extracted data using natural language processing techniques, and conversion of the summarized data into a meaningful format for users.
3. The summarization stage uses natural language toolkit processing libraries to determine sentence similarity, assign weights to sentences, and select sentences with higher ranks to include in the summary.
A Novel Data Extraction and Alignment Method for Web DatabasesIJMER
International Journal of Modern Engineering Research (IJMER) is Peer reviewed, online Journal. It serves as an international archival forum of scholarly research related to engineering and science education.
International Journal of Modern Engineering Research (IJMER) covers all the fields of engineering and science: Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Computer Engineering, Agricultural Engineering, Aerospace Engineering, Thermodynamics, Structural Engineering, Control Engineering, Robotics, Mechatronics, Fluid Mechanics, Nanotechnology, Simulators, Web-based Learning, Remote Laboratories, Engineering Design Methods, Education Research, Students' Satisfaction and Motivation, Global Projects, and Assessment…. And many more.
The web is probably too large already, and getting larger. Merging small sites is an opportunity to improve navigation efficiency and ongoing content quality - good for site users and site owners.
See related White Paper: http://slidesha.re/japR3W
International Journal of Engineering Research and Development (IJERD)IJERD Editor
call for paper 2012, hard copy of journal, research paper publishing, where to publish research paper,
journal publishing, how to publish research paper, Call For research paper, international journal, publishing a paper, IJERD, journal of science and technology, how to get a research paper published, publishing a paper, publishing of journal, publishing of research paper, reserach and review articles, IJERD Journal, How to publish your research paper, publish research paper, open access engineering journal, Engineering journal, Mathemetics journal, Physics journal, Chemistry journal, Computer Engineering, Computer Science journal, how to submit your paper, peer reviw journal, indexed journal, reserach and review articles, engineering journal, www.ijerd.com, research journals,
yahoo journals, bing journals, International Journal of Engineering Research and Development, google journals, hard copy of journal
RECOMMENDATION FOR WEB SERVICE COMPOSITION BY MINING USAGE LOGSIJDKP
Web service composition has been one of the most researched topics of the past decade. Novel methods of
web service composition are being proposed in the literature include Semantics-based composition, WSDLbased
composition. Although these methods provide promising results for composition, search and
discovery of web service based on QoS parameter of network and semantics or ontology associated with
WSDL, they do not address composition based on usage of web service. Web Service usage logs capture
time series data of web service invocation by business objects, which innately captures patterns or
workflows associated with business operations. Web service composition based on such patterns and
workflows can greatly streamline the business operations. In this research work, we try to explore and
implement methods of mining web service usage logs. Main objectives include Identifying usage
association of services. Linking one service invocation with other, Evaluation of the causal relationship
between associations of services.
This document proposes a new advanced e-commerce recommendation system that incorporates additional data beyond a user's purchase history to provide improved recommendations. Specifically, it suggests combining a user's search and purchase history with data from social media platforms like Twitter, reviews from sites like Amazon and Snapdeal, the user's location and preferences. This additional data could allow the system to recommend items a user has not previously purchased. It also describes some limitations of existing recommendation systems that only consider purchase history or require clustering users. The proposed system would use clustering and collaborative filtering together to group similar products and then make recommendations based on the evaluations within each cluster to improve accuracy.
Web usage Mining Based on Request Dependency GraphIRJET Journal
This document discusses using request dependency graphs (RDGs) to model the dependency relationships between HTTP requests for web usage mining. RDGs can improve data quality and enhance network and web server performance. The authors evaluated their approach using a large real-world web access log and found that RDGs are a useful tool for web usage mining by extracting patterns from user access behaviors and decomposing websites.
Calculating Rank of Web Documents Using Its Content and Link AnalysisIRJET Journal
This document proposes a method for calculating the rank of web documents using both content analysis and link analysis to improve search results. It discusses how current search engines use page ranking based only on hyperlinks. The proposed method first fetches links and content from the top search results for a query. It creates a dictionary to evaluate a score for each document based on how many times the query terms appear. It also considers synonyms from WordNet. The scores are then used to reorder the results to surface the most relevant pages. The algorithm is demonstrated on sample queries. Experimental results show the original and reordered rankings. The goal is to reduce irrelevant results at the top by incorporating both link and content information.
1. The document describes a search engine scraper that extracts data from websites, summarizes the extracted information, and converts it into a relevant result for users.
2. The search engine scraper works in three stages: extraction of data from website content, summarization of the extracted data using natural language processing techniques, and conversion of the summarized data into a meaningful format for users.
3. The summarization stage uses natural language toolkit processing libraries to determine sentence similarity, assign weights to sentences, and select sentences with higher ranks to include in the summary.
A Novel Data Extraction and Alignment Method for Web DatabasesIJMER
International Journal of Modern Engineering Research (IJMER) is Peer reviewed, online Journal. It serves as an international archival forum of scholarly research related to engineering and science education.
International Journal of Modern Engineering Research (IJMER) covers all the fields of engineering and science: Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Computer Engineering, Agricultural Engineering, Aerospace Engineering, Thermodynamics, Structural Engineering, Control Engineering, Robotics, Mechatronics, Fluid Mechanics, Nanotechnology, Simulators, Web-based Learning, Remote Laboratories, Engineering Design Methods, Education Research, Students' Satisfaction and Motivation, Global Projects, and Assessment…. And many more.
The web is probably too large already, and getting larger. Merging small sites is an opportunity to improve navigation efficiency and ongoing content quality - good for site users and site owners.
See related White Paper: http://slidesha.re/japR3W
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A ReviewIRJET Journal
This document summarizes research on multi-stage smart deep web crawling systems. It discusses challenges in efficiently locating deep web interfaces due to their large numbers and dynamic nature. It proposes a three-stage crawling framework to address these challenges. The first stage performs site-based searching to prioritize relevant sites. The second stage explores sites to efficiently search for forms. An adaptive learning algorithm selects features and constructs link rankers to prioritize relevant links for fast searching. Evaluation on real web data showed the framework achieves substantially higher harvest rates than existing approaches.
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...IRJET Journal
The document proposes a novel methodology for predicting consumer demand and future requests on web pages using a hybrid approach. It first classifies consumers as potential or non-potential using a firefly-based neural network with Levenberg-Marquardt algorithm. Potential consumer data is then clustered using an improved fuzzy C-means clustering algorithm. Finally, upcoming consumer demand is predicted by analyzing patterns and recommending web pages with higher weights. The proposed approach is implemented in Java and CloudSim and aims to overcome limitations of existing recommendation systems by providing more accurate and efficient predictions in shorter time.
I/O-Efficient Techniques for Computing Pagerank : NOTESSubhajit Sahu
Highlighted notes while doing research work under Prof. Dip Sankar Banerjee and Prof. Kishore Kothapalli:
I/O-Efficient Techniques for Computing Pagerank.
https://dl.acm.org/doi/10.1145/584792.584882
This paper describes a technique to partition the link file of the whole file into blocks of a range of destination nodes, with partial source nodes, so that it is possible to run power iteration of pagerank of massive graphs which do not fit in memory. The graphs must be stored on disk, and partitions of the graphs are scanned in every iteration until the ranks converge. Unlike Haveliwala's technique, this is similar to pull based pagerank. Both methods have similarities with join techniques in database systems. Topic-sensitive pagerank is also discussed which finds pagerank of graphs related to a specific keywords beforehand, and merges them together based upon the query (might return better results than global pagerank). This requires small adjustments to the random jump probability factor (1-d).
This document provides an introduction to web analytics. It begins with explaining why web analytics is needed by discussing how offline marketing lacks accountability and measurability. It then defines web analytics as the measurement, collection, analysis and reporting of internet data to understand and optimize web usage. The document outlines different types of web analytics including on-site and off-site. It also discusses the history and context of web analytics within decision support systems and business intelligence. Finally, it covers the main website data collection methods of server log file analysis and page tagging.
This document provides an introduction to web analytics. It discusses why web analytics is needed to measure the success of digital marketing efforts. Web analytics involves measuring, collecting, analyzing and reporting internet data to understand and optimize web usage. There are two main methods for collecting web analytics data: server log file analysis and page tagging. Server log files record information from a website's server, while page tagging involves including tracking code on website pages that collects user interaction data. The document outlines the advantages and considerations of each data collection method.
Providing highly accurate service recommendation for semantic clustering over...IRJET Journal
This document proposes an adaptive recommendation system to provide accurate service recommendations for semantic clustering over big data. It combines item-based collaborative filtering and content-based filtering techniques to address issues like cold starts, sparsity, and scalability. The system first performs clustering to group similar services and reduce data size. It then uses the collaborative filtering approach of finding similar users to the target user and their item ratings to generate recommendations. The content-based filtering technique considers a user's past ratings to recommend related items. Combining these techniques improves accuracy and performance for big data applications. Evaluation of this adaptive recommendation system shows it can provide highly accurate recommendations and address current limitations.
IRJET- Big Data Processes and Analysis using Hadoop FrameworkIRJET Journal
This document discusses issues with analyzing sub-datasets in a distributed manner using Hadoop, such as imbalanced computational loads and inefficient data scanning. It proposes a new approach called Data-Net that uses metadata about sub-dataset distributions stored in an Elastic-Map structure to optimize storage placement and queries. Experimental results on a 128-node cluster show that Data-Net provides better load balancing and performance for various sub-dataset analysis applications compared to the default Hadoop implementation.
Recommender System- Analyzing products by mining Data StreamsIRJET Journal
This document discusses several papers related to recommender systems and analyzing products and reviews. It discusses using data mining techniques like SVM, Naive Bayes and clustering algorithms to build recommendation systems for small businesses based on product sales and reviews. It also discusses detecting fake reviews using language analysis and summarizes papers on using Power BI for data visualization and analyzing research data. Key aspects covered include using data streams to provide recommendations in real-time, detecting fake reviews, using data visualization tools like Power BI for analysis, and combining clustering and association rule mining for recommendations.
Benchmarking Logistics Performance
Logistics professionals believe benchmarking can improve performance,
but there are several dimensions to benchmarking. Understanding these
dimensions is critical to effective benchmarking.
IRJET- Development and Design of Recommendation System for User Interest Shop...IRJET Journal
This document presents a machine learning based recommendation system for recommending products to users based on their interests. It proposes a technique called Fidoop DP that uses Voronoi diagrams to partition user data across nodes in a Hadoop cluster in order to reduce network overhead. The system tracks users' social media activities to identify brands and products they like. These are used to rank and recommend products to users on a shopping site. It was found to significantly reduce loads on Hadoop cluster nodes. The authors believe this approach could be enhanced further using real machine learning algorithms and big data from actual social media and shopping applications.
The document discusses the need for a new web-based financial modeling platform to address risks and issues with traditional spreadsheet-based modeling. Such a platform would provide compliance with regulations, assurance of data integrity, return on investment, and efficient processes. It would formalize modeling artifacts and transactions to speed up analysis, simplify communication, and reduce errors and fraud compared to error-prone spreadsheets. Adopting such a platform could help organizations gain competitive advantages through more accurate and timely insights.
Web Mine Customer Relationship ManagementIRJET Journal
This document discusses using web mining and data analytics techniques to improve customer relationship management (CRM). It begins with an abstract that outlines using data mining to make CRM approaches like segmentation, targeting, retention and loyalty more effective. It then discusses implementing analytical CRM using techniques like data warehousing and data mining. The document also covers using a web crawler to extract customer data from websites to input into CRM systems, and testing the installation of various open-source CRM systems. In conclusion, it finds that most open-source CRM systems could be implemented, with one exception.
WebSite Visit Forecasting Using Data Mining TechniquesChandana Napagoda
Data mining is a technique which is used for identifying relationships between various large amounts of data in many areas including scientific research, business planning, traffic analysis, clinical trial data mining etc. This research will be researching applicability of data mining techniques in web site visit prediction domain. Here we will be concentrating on time series regression techniques which will be used to analyse and forecast time dependent data points. Then how those techniques will be applied to forecast web site visits will be explained.
IRJET- Recommendation System based on Graph Database TechniquesIRJET Journal
This document proposes a recommendation system based on graph database techniques. It uses Neo4j to develop a recommendation approach using content-based filtering, collaborative filtering, and hybrid filtering. The system recommends restaurants and meals to customers based on reviews and friend recommendations. It stores data about restaurants, meals, customers and their reviews in a graph database to allow for complex queries and recommendations. The implementation and results of the proposed recommendation system are also discussed.
Hadoop and Hive Inspecting Maintenance of Mobile Application for Groceries Ex...IIRindia
Numerous movable applications on secure groceries expenditure and e-health have designed recently. Health aware clients respect such applications for secure groceries expenditure, particularly to avoid irritating groceries and added substances. However, there is the lack of a complete database including organized or unstructured information to help such applications. In the paper propose the Multiple Scoring Frameworks (MSF), a healthy groceries expenditure search service for movable applications using Hadoop and MapReduce (MR). The MSF works in a procedure behind a portable application to give a search service for data on groceries and groceries added substances. MSF works with similar logic from a web search engine (WSE) and it crawls over Web sources cataloguing important data for possible utilize in reacting to questions from movable applications. MSF outline and advancement are featured in the paper during its framework design, inquiry understanding, its utilization of the Hadoop/MapReduce infrastructure, and activity contents.
IRJET- Sentimental Analysis of Product Reviews for E-Commerce WebsitesIRJET Journal
This document summarizes a research paper that proposes using sentiment analysis of product reviews on e-commerce websites to help consumers decide where to purchase a product. The researchers describe collecting reviews from multiple websites, preprocessing the text, using clustering and classification algorithms like mean shift and support vector machines to label reviews as positive, negative or neutral. The system would then compare the results across websites and recommend the one with the most positive reviews to reduce the time users spend researching. Future work could include detecting fake reviews and identifying reasons for negative reviews on particular sites.
6 key things UXers need to know while working with APIsMargaret Hanley
This talk is for UX designers working in projects where the site is being fed and created by APis. The six things they need to know as they design their interfaces and workflows with API.
BIG MART SALES PREDICTION USING MACHINE LEARNINGIRJET Journal
This document describes a study that uses machine learning to predict sales at Big Mart stores. The researchers collected data on 8542 products from Kaggle and used the XGBoost regressor model to predict sales. They preprocessed the data by handling missing values, removing unnecessary attributes, data visualization, cleaning, label encoding, and splitting into training and testing sets. The XGBoost model was trained on the preprocessed data and evaluated using metrics like RMSE and R-squared. The model achieved accurate sales predictions that can help Big Mart better plan strategies to increase profits and outcompete rivals.
RESEARCH CHALLENGES IN WEB ANALYTICS – A STUDYIRJET Journal
This document discusses research challenges in web analytics. It begins with an introduction to web analytics and its uses and features such as improving user experience, enhancing e-commerce, and tracking marketing campaigns. Some common web analytics tools are then described such as Google Analytics, Excel, and Tableau. The web analytics process involves defining goals, collecting and analyzing data, reporting results, developing online strategies, and testing improvements. Key research challenges discussed include data privacy issues, evaluating e-commerce sites using tools like Google Analytics, and using analytics to improve access to archival resources online.
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
1) The document discusses the Sungal Tunnel project in Jammu and Kashmir, India, which is being constructed using the New Austrian Tunneling Method (NATM).
2) NATM involves continuous monitoring during construction to adapt to changing ground conditions, and makes extensive use of shotcrete for temporary tunnel support.
3) The methodology section outlines the systematic geotechnical design process for tunnels according to Austrian guidelines, and describes the various steps of NATM tunnel construction including initial and secondary tunnel support.
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
This study examines the effect of response reduction factors (R factors) on reinforced concrete (RC) framed structures through nonlinear dynamic analysis. Three RC frame models with varying heights (4, 8, and 12 stories) were analyzed in ETABS software under different R factors ranging from 1 to 5. The results showed that displacement increased as the R factor decreased, indicating less linear behavior for lower R factors. Drift also decreased proportionally with increasing R factors from 1 to 5. Shear forces in the frames decreased with higher R factors. In general, R factors of 3 to 5 produced more satisfactory performance with less displacement and drift. The displacement variations between different building heights were consistent at different R factors. This study evaluated how R factors influence
More Related Content
Similar to IRJET- Multi-Faceted Approach to Automated Classification of Business Web Pages
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A ReviewIRJET Journal
This document summarizes research on multi-stage smart deep web crawling systems. It discusses challenges in efficiently locating deep web interfaces due to their large numbers and dynamic nature. It proposes a three-stage crawling framework to address these challenges. The first stage performs site-based searching to prioritize relevant sites. The second stage explores sites to efficiently search for forms. An adaptive learning algorithm selects features and constructs link rankers to prioritize relevant links for fast searching. Evaluation on real web data showed the framework achieves substantially higher harvest rates than existing approaches.
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...IRJET Journal
The document proposes a novel methodology for predicting consumer demand and future requests on web pages using a hybrid approach. It first classifies consumers as potential or non-potential using a firefly-based neural network with Levenberg-Marquardt algorithm. Potential consumer data is then clustered using an improved fuzzy C-means clustering algorithm. Finally, upcoming consumer demand is predicted by analyzing patterns and recommending web pages with higher weights. The proposed approach is implemented in Java and CloudSim and aims to overcome limitations of existing recommendation systems by providing more accurate and efficient predictions in shorter time.
I/O-Efficient Techniques for Computing Pagerank : NOTESSubhajit Sahu
Highlighted notes while doing research work under Prof. Dip Sankar Banerjee and Prof. Kishore Kothapalli:
I/O-Efficient Techniques for Computing Pagerank.
https://dl.acm.org/doi/10.1145/584792.584882
This paper describes a technique to partition the link file of the whole file into blocks of a range of destination nodes, with partial source nodes, so that it is possible to run power iteration of pagerank of massive graphs which do not fit in memory. The graphs must be stored on disk, and partitions of the graphs are scanned in every iteration until the ranks converge. Unlike Haveliwala's technique, this is similar to pull based pagerank. Both methods have similarities with join techniques in database systems. Topic-sensitive pagerank is also discussed which finds pagerank of graphs related to a specific keywords beforehand, and merges them together based upon the query (might return better results than global pagerank). This requires small adjustments to the random jump probability factor (1-d).
This document provides an introduction to web analytics. It begins with explaining why web analytics is needed by discussing how offline marketing lacks accountability and measurability. It then defines web analytics as the measurement, collection, analysis and reporting of internet data to understand and optimize web usage. The document outlines different types of web analytics including on-site and off-site. It also discusses the history and context of web analytics within decision support systems and business intelligence. Finally, it covers the main website data collection methods of server log file analysis and page tagging.
This document provides an introduction to web analytics. It discusses why web analytics is needed to measure the success of digital marketing efforts. Web analytics involves measuring, collecting, analyzing and reporting internet data to understand and optimize web usage. There are two main methods for collecting web analytics data: server log file analysis and page tagging. Server log files record information from a website's server, while page tagging involves including tracking code on website pages that collects user interaction data. The document outlines the advantages and considerations of each data collection method.
Providing highly accurate service recommendation for semantic clustering over...IRJET Journal
This document proposes an adaptive recommendation system to provide accurate service recommendations for semantic clustering over big data. It combines item-based collaborative filtering and content-based filtering techniques to address issues like cold starts, sparsity, and scalability. The system first performs clustering to group similar services and reduce data size. It then uses the collaborative filtering approach of finding similar users to the target user and their item ratings to generate recommendations. The content-based filtering technique considers a user's past ratings to recommend related items. Combining these techniques improves accuracy and performance for big data applications. Evaluation of this adaptive recommendation system shows it can provide highly accurate recommendations and address current limitations.
IRJET- Big Data Processes and Analysis using Hadoop FrameworkIRJET Journal
This document discusses issues with analyzing sub-datasets in a distributed manner using Hadoop, such as imbalanced computational loads and inefficient data scanning. It proposes a new approach called Data-Net that uses metadata about sub-dataset distributions stored in an Elastic-Map structure to optimize storage placement and queries. Experimental results on a 128-node cluster show that Data-Net provides better load balancing and performance for various sub-dataset analysis applications compared to the default Hadoop implementation.
Recommender System- Analyzing products by mining Data StreamsIRJET Journal
This document discusses several papers related to recommender systems and analyzing products and reviews. It discusses using data mining techniques like SVM, Naive Bayes and clustering algorithms to build recommendation systems for small businesses based on product sales and reviews. It also discusses detecting fake reviews using language analysis and summarizes papers on using Power BI for data visualization and analyzing research data. Key aspects covered include using data streams to provide recommendations in real-time, detecting fake reviews, using data visualization tools like Power BI for analysis, and combining clustering and association rule mining for recommendations.
Benchmarking Logistics Performance
Logistics professionals believe benchmarking can improve performance,
but there are several dimensions to benchmarking. Understanding these
dimensions is critical to effective benchmarking.
IRJET- Development and Design of Recommendation System for User Interest Shop...IRJET Journal
This document presents a machine learning based recommendation system for recommending products to users based on their interests. It proposes a technique called Fidoop DP that uses Voronoi diagrams to partition user data across nodes in a Hadoop cluster in order to reduce network overhead. The system tracks users' social media activities to identify brands and products they like. These are used to rank and recommend products to users on a shopping site. It was found to significantly reduce loads on Hadoop cluster nodes. The authors believe this approach could be enhanced further using real machine learning algorithms and big data from actual social media and shopping applications.
The document discusses the need for a new web-based financial modeling platform to address risks and issues with traditional spreadsheet-based modeling. Such a platform would provide compliance with regulations, assurance of data integrity, return on investment, and efficient processes. It would formalize modeling artifacts and transactions to speed up analysis, simplify communication, and reduce errors and fraud compared to error-prone spreadsheets. Adopting such a platform could help organizations gain competitive advantages through more accurate and timely insights.
Web Mine Customer Relationship ManagementIRJET Journal
This document discusses using web mining and data analytics techniques to improve customer relationship management (CRM). It begins with an abstract that outlines using data mining to make CRM approaches like segmentation, targeting, retention and loyalty more effective. It then discusses implementing analytical CRM using techniques like data warehousing and data mining. The document also covers using a web crawler to extract customer data from websites to input into CRM systems, and testing the installation of various open-source CRM systems. In conclusion, it finds that most open-source CRM systems could be implemented, with one exception.
WebSite Visit Forecasting Using Data Mining TechniquesChandana Napagoda
Data mining is a technique which is used for identifying relationships between various large amounts of data in many areas including scientific research, business planning, traffic analysis, clinical trial data mining etc. This research will be researching applicability of data mining techniques in web site visit prediction domain. Here we will be concentrating on time series regression techniques which will be used to analyse and forecast time dependent data points. Then how those techniques will be applied to forecast web site visits will be explained.
IRJET- Recommendation System based on Graph Database TechniquesIRJET Journal
This document proposes a recommendation system based on graph database techniques. It uses Neo4j to develop a recommendation approach using content-based filtering, collaborative filtering, and hybrid filtering. The system recommends restaurants and meals to customers based on reviews and friend recommendations. It stores data about restaurants, meals, customers and their reviews in a graph database to allow for complex queries and recommendations. The implementation and results of the proposed recommendation system are also discussed.
Hadoop and Hive Inspecting Maintenance of Mobile Application for Groceries Ex...IIRindia
Numerous movable applications on secure groceries expenditure and e-health have designed recently. Health aware clients respect such applications for secure groceries expenditure, particularly to avoid irritating groceries and added substances. However, there is the lack of a complete database including organized or unstructured information to help such applications. In the paper propose the Multiple Scoring Frameworks (MSF), a healthy groceries expenditure search service for movable applications using Hadoop and MapReduce (MR). The MSF works in a procedure behind a portable application to give a search service for data on groceries and groceries added substances. MSF works with similar logic from a web search engine (WSE) and it crawls over Web sources cataloguing important data for possible utilize in reacting to questions from movable applications. MSF outline and advancement are featured in the paper during its framework design, inquiry understanding, its utilization of the Hadoop/MapReduce infrastructure, and activity contents.
IRJET- Sentimental Analysis of Product Reviews for E-Commerce WebsitesIRJET Journal
This document summarizes a research paper that proposes using sentiment analysis of product reviews on e-commerce websites to help consumers decide where to purchase a product. The researchers describe collecting reviews from multiple websites, preprocessing the text, using clustering and classification algorithms like mean shift and support vector machines to label reviews as positive, negative or neutral. The system would then compare the results across websites and recommend the one with the most positive reviews to reduce the time users spend researching. Future work could include detecting fake reviews and identifying reasons for negative reviews on particular sites.
6 key things UXers need to know while working with APIsMargaret Hanley
This talk is for UX designers working in projects where the site is being fed and created by APis. The six things they need to know as they design their interfaces and workflows with API.
BIG MART SALES PREDICTION USING MACHINE LEARNINGIRJET Journal
This document describes a study that uses machine learning to predict sales at Big Mart stores. The researchers collected data on 8542 products from Kaggle and used the XGBoost regressor model to predict sales. They preprocessed the data by handling missing values, removing unnecessary attributes, data visualization, cleaning, label encoding, and splitting into training and testing sets. The XGBoost model was trained on the preprocessed data and evaluated using metrics like RMSE and R-squared. The model achieved accurate sales predictions that can help Big Mart better plan strategies to increase profits and outcompete rivals.
RESEARCH CHALLENGES IN WEB ANALYTICS – A STUDYIRJET Journal
This document discusses research challenges in web analytics. It begins with an introduction to web analytics and its uses and features such as improving user experience, enhancing e-commerce, and tracking marketing campaigns. Some common web analytics tools are then described such as Google Analytics, Excel, and Tableau. The web analytics process involves defining goals, collecting and analyzing data, reporting results, developing online strategies, and testing improvements. Key research challenges discussed include data privacy issues, evaluating e-commerce sites using tools like Google Analytics, and using analytics to improve access to archival resources online.
Similar to IRJET- Multi-Faceted Approach to Automated Classification of Business Web Pages (20)
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
1) The document discusses the Sungal Tunnel project in Jammu and Kashmir, India, which is being constructed using the New Austrian Tunneling Method (NATM).
2) NATM involves continuous monitoring during construction to adapt to changing ground conditions, and makes extensive use of shotcrete for temporary tunnel support.
3) The methodology section outlines the systematic geotechnical design process for tunnels according to Austrian guidelines, and describes the various steps of NATM tunnel construction including initial and secondary tunnel support.
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
This study examines the effect of response reduction factors (R factors) on reinforced concrete (RC) framed structures through nonlinear dynamic analysis. Three RC frame models with varying heights (4, 8, and 12 stories) were analyzed in ETABS software under different R factors ranging from 1 to 5. The results showed that displacement increased as the R factor decreased, indicating less linear behavior for lower R factors. Drift also decreased proportionally with increasing R factors from 1 to 5. Shear forces in the frames decreased with higher R factors. In general, R factors of 3 to 5 produced more satisfactory performance with less displacement and drift. The displacement variations between different building heights were consistent at different R factors. This study evaluated how R factors influence
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
This study compares the use of Stark Steel and TMT Steel as reinforcement materials in a two-way reinforced concrete slab. Mechanical testing is conducted to determine the tensile strength, yield strength, and other properties of each material. A two-way slab design adhering to codes and standards is executed with both materials. The performance is analyzed in terms of deflection, stability under loads, and displacement. Cost analyses accounting for material, durability, maintenance, and life cycle costs are also conducted. The findings provide insights into the economic and structural implications of each material for reinforcement selection and recommendations on the most suitable material based on the analysis.
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
This document discusses a study analyzing the effect of camber, position of camber, and angle of attack on the aerodynamic characteristics of airfoils. Sixteen modified asymmetric NACA airfoils were analyzed using computational fluid dynamics (CFD) by varying the camber, camber position, and angle of attack. The results showed the relationship between these parameters and the lift coefficient, drag coefficient, and lift to drag ratio. This provides insight into how changes in airfoil geometry impact aerodynamic performance.
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
This document reviews the progress and challenges of aluminum-based metal matrix composites (MMCs), focusing on their fabrication processes and applications. It discusses how various aluminum MMCs have been developed using reinforcements like borides, carbides, oxides, and nitrides to improve mechanical and wear properties. These composites have gained prominence for their lightweight, high-strength and corrosion resistance properties. The document also examines recent advancements in fabrication techniques for aluminum MMCs and their growing applications in industries such as aerospace and automotive. However, it notes that challenges remain around issues like improper mixing of reinforcements and reducing reinforcement agglomeration.
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
This document discusses research on using graph neural networks (GNNs) for dynamic optimization of public transportation networks in real-time. GNNs represent transit networks as graphs with nodes as stops and edges as connections. The GNN model aims to optimize networks using real-time data on vehicle locations, arrival times, and passenger loads. This helps increase mobility, decrease traffic, and improve efficiency. The system continuously trains and infers to adapt to changing transit conditions, providing decision support tools. While research has focused on performance, more work is needed on security, socio-economic impacts, contextual generalization of models, continuous learning approaches, and effective real-time visualization.
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
This document summarizes a research project that aims to compare the structural performance of conventional slab and grid slab systems in multi-story buildings using ETABS software. The study will analyze both symmetric and asymmetric building models under various loading conditions. Parameters like deflections, moments, shears, and stresses will be examined to evaluate the structural effectiveness of each slab type. The results will provide insights into the comparative behavior of conventional and grid slabs to help engineers and architects select appropriate slab systems based on building layouts and design requirements.
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
This document summarizes and reviews a research paper on the seismic response of reinforced concrete (RC) structures with plan and vertical irregularities, with and without infill walls. It discusses how infill walls can improve or reduce the seismic performance of RC buildings, depending on factors like wall layout, height distribution, connection to the frame, and relative stiffness of walls and frames. The reviewed research paper analyzes the behavior of infill walls, effects of vertical irregularities, and seismic performance of high-rise structures under linear static and dynamic analysis. It studies response characteristics like story drift, deflection and shear. The document also provides literature on similar research investigating the effects of infill walls, soft stories, plan irregularities, and different
This document provides a review of machine learning techniques used in Advanced Driver Assistance Systems (ADAS). It begins with an abstract that summarizes key applications of machine learning in ADAS, including object detection, recognition, and decision-making. The introduction discusses the integration of machine learning in ADAS and how it is transforming vehicle safety. The literature review then examines several research papers on topics like lightweight deep learning models for object detection and lane detection models using image processing. It concludes by discussing challenges and opportunities in the field, such as improving algorithm robustness and adaptability.
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
The document analyzes temperature and precipitation trends in Asosa District, Benishangul Gumuz Region, Ethiopia from 1993 to 2022 based on data from the local meteorological station. The results show:
1) The average maximum and minimum annual temperatures have generally decreased over time, with maximum temperatures decreasing by a factor of -0.0341 and minimum by -0.0152.
2) Mann-Kendall tests found the decreasing temperature trends to be statistically significant for annual maximum temperatures but not for annual minimum temperatures.
3) Annual precipitation in Asosa District showed a statistically significant increasing trend.
The conclusions recommend development planners account for rising summer precipitation and declining temperatures in
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
This document discusses the design and analysis of pre-engineered building (PEB) framed structures using STAAD Pro software. It provides an overview of PEBs, including that they are designed off-site with building trusses and beams produced in a factory. STAAD Pro is identified as a key tool for modeling, analyzing, and designing PEBs to ensure their performance and safety under various load scenarios. The document outlines modeling structural parts in STAAD Pro, evaluating structural reactions, assigning loads, and following international design codes and standards. In summary, STAAD Pro is used to design and analyze PEB framed structures to ensure safety and code compliance.
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
This document provides a review of research on innovative fiber integration methods for reinforcing concrete structures. It discusses studies that have explored using carbon fiber reinforced polymer (CFRP) composites with recycled plastic aggregates to develop more sustainable strengthening techniques. It also examines using ultra-high performance fiber reinforced concrete to improve shear strength in beams. Additional topics covered include the dynamic responses of FRP-strengthened beams under static and impact loads, and the performance of preloaded CFRP-strengthened fiber reinforced concrete beams. The review highlights the potential of fiber composites to enable more sustainable and resilient construction practices.
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
This document summarizes a survey on securing patient healthcare data in cloud-based systems. It discusses using technologies like facial recognition, smart cards, and cloud computing combined with strong encryption to securely store patient data. The survey found that healthcare professionals believe digitizing patient records and storing them in a centralized cloud system would improve access during emergencies and enable more efficient care compared to paper-based systems. However, ensuring privacy and security of patient data is paramount as healthcare incorporates these digital technologies.
Review on studies and research on widening of existing concrete bridgesIRJET Journal
This document summarizes several studies that have been conducted on widening existing concrete bridges. It describes a study from China that examined load distribution factors for a bridge widened with composite steel-concrete girders. It also outlines challenges and solutions for widening a bridge in the UAE, including replacing bearings and stitching the new and existing structures. Additionally, it discusses two bridge widening projects in New Zealand that involved adding precast beams and stitching to connect structures. Finally, safety measures and challenges for strengthening a historic bridge in Switzerland under live traffic are presented.
React based fullstack edtech web applicationIRJET Journal
The document describes the architecture of an educational technology web application built using the MERN stack. It discusses the frontend developed with ReactJS, backend with NodeJS and ExpressJS, and MongoDB database. The frontend provides dynamic user interfaces, while the backend offers APIs for authentication, course management, and other functions. MongoDB enables flexible data storage. The architecture aims to provide a scalable, responsive platform for online learning.
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
This paper proposes integrating Internet of Things (IoT) and blockchain technologies to help implement objectives of India's National Education Policy (NEP) in the education sector. The paper discusses how blockchain could be used for secure student data management, credential verification, and decentralized learning platforms. IoT devices could create smart classrooms, automate attendance tracking, and enable real-time monitoring. Blockchain would ensure integrity of exam processes and resource allocation, while smart contracts automate agreements. The paper argues this integration has potential to revolutionize education by making it more secure, transparent and efficient, in alignment with NEP goals. However, challenges like infrastructure needs, data privacy, and collaborative efforts are also discussed.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
This document provides a review of research on the performance of coconut fibre reinforced concrete. It summarizes several studies that tested different volume fractions and lengths of coconut fibres in concrete mixtures with varying compressive strengths. The studies found that coconut fibre improved properties like tensile strength, toughness, crack resistance, and spalling resistance compared to plain concrete. Volume fractions of 2-5% and fibre lengths of 20-50mm produced the best results. The document concludes that using a 4-5% volume fraction of coconut fibres 30-40mm in length with M30-M60 grade concrete would provide benefits based on previous research.
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
The document discusses optimizing business management processes through automation using Microsoft Power Automate and artificial intelligence. It provides an overview of Power Automate's key components and features for automating workflows across various apps and services. The document then presents several scenarios applying automation solutions to common business processes like data entry, monitoring, HR, finance, customer support, and more. It estimates the potential time and cost savings from implementing automation for each scenario. Finally, the conclusion emphasizes the transformative impact of AI and automation tools on business processes and the need for ongoing optimization.
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
The document describes the seismic design of a G+5 steel building frame located in Roorkee, India according to Indian codes IS 1893-2002 and IS 800. The frame was analyzed using the equivalent static load method and response spectrum method, and its response in terms of displacements and shear forces were compared. Based on the analysis, the frame was designed as a seismic-resistant steel structure according to IS 800:2007. The software STAAD Pro was used for the analysis and design.
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
This research paper explores using plastic waste as a sustainable and cost-effective construction material. The study focuses on manufacturing pavers and bricks using recycled plastic and partially replacing concrete with plastic alternatives. Initial results found that pavers and bricks made from recycled plastic demonstrate comparable strength and durability to traditional materials while providing environmental and cost benefits. Additionally, preliminary research indicates incorporating plastic waste as a partial concrete replacement significantly reduces construction costs without compromising structural integrity. The outcomes suggest adopting plastic waste in construction can address plastic pollution while optimizing costs, promoting more sustainable building practices.
Advanced control scheme of doubly fed induction generator for wind turbine us...IJECEIAES
This paper describes a speed control device for generating electrical energy on an electricity network based on the doubly fed induction generator (DFIG) used for wind power conversion systems. At first, a double-fed induction generator model was constructed. A control law is formulated to govern the flow of energy between the stator of a DFIG and the energy network using three types of controllers: proportional integral (PI), sliding mode controller (SMC) and second order sliding mode controller (SOSMC). Their different results in terms of power reference tracking, reaction to unexpected speed fluctuations, sensitivity to perturbations, and resilience against machine parameter alterations are compared. MATLAB/Simulink was used to conduct the simulations for the preceding study. Multiple simulations have shown very satisfying results, and the investigations demonstrate the efficacy and power-enhancing capabilities of the suggested control system.
We have compiled the most important slides from each speaker's presentation. This year’s compilation, available for free, captures the key insights and contributions shared during the DfMAy 2024 conference.
Introduction- e - waste – definition - sources of e-waste– hazardous substances in e-waste - effects of e-waste on environment and human health- need for e-waste management– e-waste handling rules - waste minimization techniques for managing e-waste – recycling of e-waste - disposal treatment methods of e- waste – mechanism of extraction of precious metal from leaching solution-global Scenario of E-waste – E-waste in India- case studies.
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELgerogepatton
As digital technology becomes more deeply embedded in power systems, protecting the communication
networks of Smart Grids (SG) has emerged as a critical concern. Distributed Network Protocol 3 (DNP3)
represents a multi-tiered application layer protocol extensively utilized in Supervisory Control and Data
Acquisition (SCADA)-based smart grids to facilitate real-time data gathering and control functionalities.
Robust Intrusion Detection Systems (IDS) are necessary for early threat detection and mitigation because
of the interconnection of these networks, which makes them vulnerable to a variety of cyberattacks. To
solve this issue, this paper develops a hybrid Deep Learning (DL) model specifically designed for intrusion
detection in smart grids. The proposed approach is a combination of the Convolutional Neural Network
(CNN) and the Long-Short-Term Memory algorithms (LSTM). We employed a recent intrusion detection
dataset (DNP3), which focuses on unauthorized commands and Denial of Service (DoS) cyberattacks, to
train and test our model. The results of our experiments show that our CNN-LSTM method is much better
at finding smart grid intrusions than other deep learning algorithms used for classification. In addition,
our proposed approach improves accuracy, precision, recall, and F1 score, achieving a high detection
accuracy rate of 99.50%.
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesChristina Lin
Traditionally, dealing with real-time data pipelines has involved significant overhead, even for straightforward tasks like data transformation or masking. However, in this talk, we’ll venture into the dynamic realm of WebAssembly (WASM) and discover how it can revolutionize the creation of stateless streaming pipelines within a Kafka (Redpanda) broker. These pipelines are adept at managing low-latency, high-data-volume scenarios.
A review on techniques and modelling methodologies used for checking electrom...nooriasukmaningtyas
The proper function of the integrated circuit (IC) in an inhibiting electromagnetic environment has always been a serious concern throughout the decades of revolution in the world of electronics, from disjunct devices to today’s integrated circuit technology, where billions of transistors are combined on a single chip. The automotive industry and smart vehicles in particular, are confronting design issues such as being prone to electromagnetic interference (EMI). Electronic control devices calculate incorrect outputs because of EMI and sensors give misleading values which can prove fatal in case of automotives. In this paper, the authors have non exhaustively tried to review research work concerned with the investigation of EMI in ICs and prediction of this EMI using various modelling methodologies and measurement setups.
Literature Review Basics and Understanding Reference Management.pptxDr Ramhari Poudyal
Three-day training on academic research focuses on analytical tools at United Technical College, supported by the University Grant Commission, Nepal. 24-26 May 2024