SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1149
Knowledge Discovery of Small Business Domain Using Web Crawling
and Data Mining
Latha M1, Shivanand R D2
1M.Tech. Department of Computer Science and Engineering, Bapuji Institute of Technology, Davanagere,
Karnataka, India.
2Associate Professor, Department of Computer Science and Engineering, Bapuji Institute of Technology,
Davanagere, Karnataka, India.
---------------------------------------------------------------------***-------------------------------------------------------------
Abstract - Now a days come to be where everything
information is obtainable on the web. If also there are many
complications to make use of the web professionally. Due to
more information, the users does not get the information
related to their requirement. . Among the businesses, a lots
of small business are generally not noticed by the people.
The static features of small businessare businessname,area,
contact and website address of business. Related to small
business these static features are easily cached by people.
The dynamic features of the business includes reputations.
But it is very difficult to get the reputation of small business.
Because of the shortage of means of boosting users to write
remarks for their services. In this project focused on
developing a knowledge base of small business and to give
the valuable information about the business to user. To
develop a knowledge base of small business static and
dynamic data is considered. Thewebcrawlingappliedonthe
static data. Twitter considered as the dynamic data source.
The user posting the tweets about the business. The posted
tweets are analyzed to know whether the tweets contains
positive or negative responses about the business.
Key Words: Small business, data analysis, web crawling,
data mining, knowledge base, knowledge discovery
1. INTRODUCTION
Currently more information is available on the web
related to any subject. But these more information is to be
sort out. Because of the big data [1] defines the three main
features such as capacity, speed and diversity. Due to this
many complications to utilize the source of web in well-
organized manner. Most of the user does not get their
required information from the web. Related to this most the
web content written in web standard markup language that
is HTML. It is intended for web programs to parse so they
can draw graphical designs. The graphical designs the most
part containing writings, connections, images, and some of
the time it also contains different media content. Given the
outline reason for HTML to show data on screen is the main
purpose of HTML. But it is nearly difficult by the computer
programs to retrieve helpful data through the HTML
composed content without additional language processing.
With respect to above background where
information on the web was quickly developing. To increase
the performance of the small business is get increasing ifthe
there is an efficient small business exist. To develop a well-
organized small business it requires a good knowledge.
Small Businesses [2] are privately orindependently
owned organizations. The size of the small business is
limited in size and only few employees are working in that
organization. The annual returns of thesmall businessisless
compared to the other business. The very less amount is
enough to start small this is one of the advantage of small
business. Individuality is another advantage of small
business. The small business owners have the capacity to
manage by themselves it makes a great thing.
The small business routines mainly connected to
meet the user requirements. Those organizations that
embrace this promoting idea will probably succeed when
they are consolidated with development and differential
procedures .The market-orientated vital stances of effective
private companies are directed by therequirementsofclient
drove everyday exercises. Small business successful is
mainly concentrating on the handling of customer drive and
market positioning. If these two perspectives are well
handled by the small business means it leads to a successful,
the present with upcoming.
With respect to above,developinga knowledgebase
[3] of small business and to deliver the information about
the business to users. Now a days a lots of business are
developing rapidly. But the small businesses are not easily
noticed by the people. Related to small business the static
data once stored it’s not change over time. The static data of
the small business are business name, area, contact and
website address. These data are easily available to the
public. But the dynamic data it will change over time. The
dynamic features of the business are user opinion about the
services or administration. These features are very difficult
to retrieve. Because they have a lack of methods to support
user to write remarks about for their administration
services. The Small business examples are hotels, stores,
restaurant, bakery and shops. In this project restaurant is
taken as the small business example.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1150
Primary needs of human beings are food and
beverages. These primary needs are provided byrestaurant.
Because of this the restaurant business act as the lifespan
business. And it’s not a certainly losing attraction in
customer’s encouragement. If the owner of the restaurant is
good capacity to achieve and run it well means the
restaurant becomes fast growing. Some individuals thought
that if good quality of food and well atmosphere services
provided to customer’s means to become a successful
business. But many functions that are considered for the
successful of restaurant not only good taste of food.
To retrieve the reputation of the small business it is
very difficult so for that social media considered for the
reputation. Now a days the common thing by the people is
posting the status about their daily life on social media.
Usually the status of the people in the form of comments.
These comments involves where they went to spend their
valuable time, in which hotel they had food and what is the
taste of the food and how was environment of that hotel.
The people added status about their daily life usually it is
just positive or negative opinions about the service taken by
particular organization. The userpostedcommentsonsocial
media are in the form of natural languages and these
comments are does not indicating any rating. So some data
mining techniques are requires to increase the accuracy of
the information.
In this project focused on developing a knowledge
discovery [4] of small business.Firstbusinessstatic data that
is website address is considered. And dynamic data as the
Twitter [5]. The web crawling applied on the static data.
Then extraction of information from the corresponding
webpage. Then allow the user to post the comments about
the particular business. Then the user posted comments are
analyzed. The analyzing of user comments to know whether
the tweets [6] contains positive or negative response about
the business. If the comments are more positive responses
about the business then it gives the total number of positive
tweets with accuracy about the particular business. If the
comments are more negative responses about the business
then it gives the total number of negative tweets with
accuracy about the particular business. So some kind ofdata
analysis [7] is needed to figure out how much positive or
negative each of the tweets are about the business.
1.1 Web Crawling
Web crawling [7] is the process of search engines
combing through web pages inordertoproperlyindexthem.
Web crawling makes it easier for search engine toreturn the
most relevant results to user after they enter a searchquery.
1.2 Data Mining
Data mining [8] is the automatic or semiautomatic
analysis of large quantities of data to extract previously
unknown, interesting patterns such as groups of data
records, unusual records and dependencies.
2. SYSTEM ARCHITECTURE
Fig -1: Knowledge Discovery Process
The overall processes for the knowledge discovery
are overviewed in the Figure 1.It involves the web crawling
on static data source. Image and information extracted from
the retrieved page is stored in knowledge base. Twitter as
the dynamic data source. Users posted tweets are analyzed
with the accuracy.
3. RELATED WORK
The following sections gives the related work for this
paper. Related to knowledge discovery of small business
both static and dynamic data is considered.
3.1 Web Crawling On Static Data
The initial stage involves data collection of the form
static. The defined static data that are not produced byusers
and likely to remain unchanged for a long time as staticdata.
These data include their names, locations, and contacts,
types of businesses. In this phase web crawling applied on
the static data of business to produce knowledge base of
small business. From the crawled page image and
information is extracted is storedintheknowledgebase. The
website address of Apoorva resorts is located in davanagere
is http://www.apoorvaresortsdavangere.in.The following
figure shows the web crawling on static data.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1151
Fig -2: Web Crawling Process
3.2 Tweets Analysis
In this phase first user Twitter account created. To
generate Twitter API keys, Access TokenandSecretKeys the
Twitter application is created. To create Twitter Application
which is mandatory to access Twitter. It includes the
following steps.
1. Visit the website https://dev.twitter.com/apps/new and
logging into Twitter account then click Create New App.It is
shown in the figure 3 and figure 4.
Fig-3: Home Page of Twitter Developer Platform
Fig-4: Login Page of Twitter
2. On the Create an application page, enter the requested
information for the new twitter application. Enter the
Application Name, Description and user website address.
Choose an application name it must not be already taken by
another user. It is shown in the following figure 5.
3. Twitter Application is created. On the main page, click
Keys and Access Tokens. This Application Settings section
contains the Consumer KeyandConsumer Secret. Itisshown
in the following figure 6.
4. Under the Your Access Token section, click Create my
access token. The access tokens are generated and ready to
be used. It is shown in the following figure 7.
Fig-5: Creation of Twitter Application Details
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1152
Fig-6: Consumer Key and Consumer Secret
Fig-7: Access Token
5. Twitter authentication information includesthefollowing
elements: Consumer Key, Consumer Secret, Access Token
and Access Token Secret. All these need for
programmatically to Twitter.
Then finally related to tweet analysis the user will
tweets are posted and loaded. And tweets are analyzed
either positive or negative with the accuracy. The entire
tweets analysis is shown in the figure 8.
Fig-8-: Tweets Analysis
4. RESULTS
The following sections gives the discussions on
experimental results.
4.1 web crawling
Web crawling is applied on the Website of Apoorva
resorts Davangere. And the html page is crawled. Itisshown
in the figure 9.
Fig-9: Crawled HTML Page
4.2 Extraction of All Images:
Images of Apoorva resorts are extracted from the
crawled html page. It is shown in the figure 10.
Fig-10: Image Extraction from the Crawled HTML Page
4.3 Tweets Analysis
Related to this the user posting the tweets about
the business. The user tweets are may contain positive or
negative tweets.After the posting of user tweets,then tweets
are loaded to user Twitter account. After entering the
particular item for analysis. Then it shows the total number
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1153
of tweets belongs to that particularcategory.Thisshowsthat
the total number of tweets and also positive or negative
tweets with accuracy about the business. All these
procedures shown in the following figures.
Fig-11: Posting of Tweets
Fig-12: Loading of Tweets into user Twitter account
Fig-13: Selecting of particular Category
Fig-14: Analysis of Tweets of particular category
Fig-15: Analysis of posted Tweets
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1154
Fig-16: Analysis of posted Tweets with accuracy
5. CONCLUSION AND FUTURE WORK
The Knowledge base of small business is built by
considering static and dynamic data. Thestatic data includes
business name, location, contact and website address of
businesses. Web crawling applied on static data. And for the
dynamic data, Twitter is considered. However, user
comments on social media usually do not have ratings to
indicate how much positive or negative their reactions are.
So comments or reviews are analyzed. The user posted
tweets into Twitter are analyzed to knowhowmuchpositive
or negative reactions about the businesses. Thus user
opinions are useful to understand their preferences, and
reputations of places or services they used.
For future work there is also the need to make sure
the data are reliable. Since there always can be fake reviews
from users who write reviews out of malicious purposes or
are hired by businesses to write glowing reviews on social
media. Sometimesinformationfromusersorwebsitescanbe
out of date or not consistent with each other. These factors
should be considered in the future as well to improve the
reliability of knowledge discovery system.
5. REFERENCES
[1] Edd Dumbill, Forbes, Volume, Velocity, Variety: “What
You Need to Know About Big Data”, JAN 2012.
[2] Headd, Brian and Bruce Kirchhoff, “The growth, decline
and survival of small businesses:Anexploratorystudyof
life cycles,” Journal of Small Business Management, pp.
531-550, October2009.
[3] Akerkar, R.A. and Sajja, P.S. 2009. Knowledge-based
systems: Jones & Bartlett Publishers, Sudbury,MA, USA
[4] Ravindra Changala, D.Rajeswara Rao,TJanardhana Rao,
P Kiran Kumar, Kareemunnisa ,2015 “Knowledge
Discovery Process: The Next Step for KnowledgeSearch
[5] F.Morstatter, S. Kumar, H. Liu, and R. Maciejewski.
Understanding Twitter Data with TweetXplorer. In
Proceedings of the 2013 ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining,
ACM, 2013.
[6] Raheleh Makki,Axel j.Soto, Stephen Brooks 2016.
Twitter message recommendation based on user
interest profiles
[7] Elyasir, Ayoub Mohamed, and Kalaiarasi Sonai Muthu.
"Focused Web Crawler." International Conference on
InformationandKnowledgeManagement(ICIKM2012).
[8] Hemlata Sahu, Shalini Shrma, Seema Gondhalakar.”A
Brief Overview on Data Mining Survey “2008
International Journal of Computer Technology and
Electronics Engineering (IJCTEE) Volume 1, Issue 3

More Related Content

What's hot

Data Analysis Industry Report 2016 - Nigeria
Data Analysis Industry Report 2016 - NigeriaData Analysis Industry Report 2016 - Nigeria
Data Analysis Industry Report 2016 - Nigeria
Michael Olafusi
 
The 7 Deadly Sins of SharePoint: Planning Successful Implementations and Avoi...
The 7 Deadly Sins of SharePoint: Planning Successful Implementations and Avoi...The 7 Deadly Sins of SharePoint: Planning Successful Implementations and Avoi...
The 7 Deadly Sins of SharePoint: Planning Successful Implementations and Avoi...
C5 Insight
 
Business Case for Data Mashup
Business Case for Data MashupBusiness Case for Data Mashup
Business Case for Data Mashup
ArleneWatson
 
Offers bank dss
Offers bank dssOffers bank dss
Offers bank dss
ghada alajlan
 
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET Journal
 
3 30022 assessing_yourbusinessanalytics
3 30022 assessing_yourbusinessanalytics3 30022 assessing_yourbusinessanalytics
3 30022 assessing_yourbusinessanalytics
cragsmoor123
 
eTailing India Launches Big Data Report - 2015
eTailing India Launches Big Data Report - 2015 eTailing India Launches Big Data Report - 2015
eTailing India Launches Big Data Report - 2015
eTailing India
 
A treatise on SAP CRM information reporting
A treatise on SAP CRM information reportingA treatise on SAP CRM information reporting
A treatise on SAP CRM information reporting
Vijay Raj
 
Coveo_Intelligent Workspace_eBook_FINAL
Coveo_Intelligent Workspace_eBook_FINALCoveo_Intelligent Workspace_eBook_FINAL
Coveo_Intelligent Workspace_eBook_FINAL
Stephen Weidman
 
D&B US Economic Health Tracker | May 2014
D&B US Economic Health Tracker | May 2014D&B US Economic Health Tracker | May 2014
D&B US Economic Health Tracker | May 2014
Dun & Bradstreet
 
Small and medium enterprise business solutions using data visualization
Small and medium enterprise business solutions using data visualizationSmall and medium enterprise business solutions using data visualization
Small and medium enterprise business solutions using data visualization
journalBEEI
 
Requirements Workshop -Text Analytics System - Serene Zawaydeh
Requirements Workshop -Text Analytics System - Serene ZawaydehRequirements Workshop -Text Analytics System - Serene Zawaydeh
Requirements Workshop -Text Analytics System - Serene Zawaydeh
Serene Zawaydeh
 
Big Data Survey/Handbook Summary Charts - 8 JULY 2013
Big Data Survey/Handbook Summary Charts - 8 JULY 2013Big Data Survey/Handbook Summary Charts - 8 JULY 2013
Big Data Survey/Handbook Summary Charts - 8 JULY 2013
Lora Cecere
 
Sharepoint adoption guide
Sharepoint adoption guideSharepoint adoption guide
Sharepoint adoption guide
Heo Gòm
 
An Analysis of Big Data Computing for Efficiency of Business Operations Among...
An Analysis of Big Data Computing for Efficiency of Business Operations Among...An Analysis of Big Data Computing for Efficiency of Business Operations Among...
An Analysis of Big Data Computing for Efficiency of Business Operations Among...
AnthonyOtuonye
 
Lean Business Intelligence - How and Why Organizations Are Moving to Self-Ser...
Lean Business Intelligence - How and Why Organizations Are Moving to Self-Ser...Lean Business Intelligence - How and Why Organizations Are Moving to Self-Ser...
Lean Business Intelligence - How and Why Organizations Are Moving to Self-Ser...
FindWhitePapers
 

What's hot (16)

Data Analysis Industry Report 2016 - Nigeria
Data Analysis Industry Report 2016 - NigeriaData Analysis Industry Report 2016 - Nigeria
Data Analysis Industry Report 2016 - Nigeria
 
The 7 Deadly Sins of SharePoint: Planning Successful Implementations and Avoi...
The 7 Deadly Sins of SharePoint: Planning Successful Implementations and Avoi...The 7 Deadly Sins of SharePoint: Planning Successful Implementations and Avoi...
The 7 Deadly Sins of SharePoint: Planning Successful Implementations and Avoi...
 
Business Case for Data Mashup
Business Case for Data MashupBusiness Case for Data Mashup
Business Case for Data Mashup
 
Offers bank dss
Offers bank dssOffers bank dss
Offers bank dss
 
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
IRJET- Strength and Workability of High Volume Fly Ash Self-Compacting Concre...
 
3 30022 assessing_yourbusinessanalytics
3 30022 assessing_yourbusinessanalytics3 30022 assessing_yourbusinessanalytics
3 30022 assessing_yourbusinessanalytics
 
eTailing India Launches Big Data Report - 2015
eTailing India Launches Big Data Report - 2015 eTailing India Launches Big Data Report - 2015
eTailing India Launches Big Data Report - 2015
 
A treatise on SAP CRM information reporting
A treatise on SAP CRM information reportingA treatise on SAP CRM information reporting
A treatise on SAP CRM information reporting
 
Coveo_Intelligent Workspace_eBook_FINAL
Coveo_Intelligent Workspace_eBook_FINALCoveo_Intelligent Workspace_eBook_FINAL
Coveo_Intelligent Workspace_eBook_FINAL
 
D&B US Economic Health Tracker | May 2014
D&B US Economic Health Tracker | May 2014D&B US Economic Health Tracker | May 2014
D&B US Economic Health Tracker | May 2014
 
Small and medium enterprise business solutions using data visualization
Small and medium enterprise business solutions using data visualizationSmall and medium enterprise business solutions using data visualization
Small and medium enterprise business solutions using data visualization
 
Requirements Workshop -Text Analytics System - Serene Zawaydeh
Requirements Workshop -Text Analytics System - Serene ZawaydehRequirements Workshop -Text Analytics System - Serene Zawaydeh
Requirements Workshop -Text Analytics System - Serene Zawaydeh
 
Big Data Survey/Handbook Summary Charts - 8 JULY 2013
Big Data Survey/Handbook Summary Charts - 8 JULY 2013Big Data Survey/Handbook Summary Charts - 8 JULY 2013
Big Data Survey/Handbook Summary Charts - 8 JULY 2013
 
Sharepoint adoption guide
Sharepoint adoption guideSharepoint adoption guide
Sharepoint adoption guide
 
An Analysis of Big Data Computing for Efficiency of Business Operations Among...
An Analysis of Big Data Computing for Efficiency of Business Operations Among...An Analysis of Big Data Computing for Efficiency of Business Operations Among...
An Analysis of Big Data Computing for Efficiency of Business Operations Among...
 
Lean Business Intelligence - How and Why Organizations Are Moving to Self-Ser...
Lean Business Intelligence - How and Why Organizations Are Moving to Self-Ser...Lean Business Intelligence - How and Why Organizations Are Moving to Self-Ser...
Lean Business Intelligence - How and Why Organizations Are Moving to Self-Ser...
 

Similar to Knowledge Discovery of Small Business Domain using Web Crawling and Data Mining

Web analytics
Web analyticsWeb analytics
Web analytics
Abhimanyu Sood
 
IRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum RevolutionIRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum Revolution
IRJET Journal
 
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
IRJET Journal
 
Data Visualization Tools and Techniques for Datasets in Big Data
Data Visualization Tools and Techniques for Datasets in Big DataData Visualization Tools and Techniques for Datasets in Big Data
Data Visualization Tools and Techniques for Datasets in Big Data
IRJET Journal
 
8 benefits of business intelligence tools for your organization
8 benefits of business intelligence tools for your organization8 benefits of business intelligence tools for your organization
8 benefits of business intelligence tools for your organization
raghunathan janarthanan
 
Sister Flowers Personal Essay
Sister Flowers Personal EssaySister Flowers Personal Essay
Sister Flowers Personal Essay
Amanda Cote
 
Optimizing the Online Business Channel With Web Analytics
Optimizing the Online Business Channel With Web AnalyticsOptimizing the Online Business Channel With Web Analytics
Optimizing the Online Business Channel With Web Analytics
Chris McFadden
 
ENTREPRENEURSHIPPAPER
ENTREPRENEURSHIPPAPERENTREPRENEURSHIPPAPER
ENTREPRENEURSHIPPAPER
Betsey Pope
 
Internship report
Internship reportInternship report
Internship report
Alok Chaudhary
 
Age Friendly Economy - Improving your business with data
Age Friendly Economy - Improving your business with dataAge Friendly Economy - Improving your business with data
Age Friendly Economy - Improving your business with data
AgeFriendlyEconomy
 
IRJET- Virtual Business Analyst using a Progressive Web Application
IRJET- Virtual Business Analyst using a Progressive Web ApplicationIRJET- Virtual Business Analyst using a Progressive Web Application
IRJET- Virtual Business Analyst using a Progressive Web Application
IRJET Journal
 
Barry Ooi; Big Data lookb4YouLeap
Barry Ooi; Big Data lookb4YouLeapBarry Ooi; Big Data lookb4YouLeap
Barry Ooi; Big Data lookb4YouLeap
Barry Ooi
 
[Notes] Customer 360 Analytics with LEO CDP
[Notes] Customer 360 Analytics with LEO CDP[Notes] Customer 360 Analytics with LEO CDP
[Notes] Customer 360 Analytics with LEO CDP
Trieu Nguyen
 
The CFO in the Age of Digital Analytics
The CFO in the Age of Digital AnalyticsThe CFO in the Age of Digital Analytics
The CFO in the Age of Digital Analytics
Anametrix
 
Big Data
Big DataBig Data
17 Must-Do's to Create a Product-Centric IT Organization
17 Must-Do's to Create a Product-Centric IT Organization17 Must-Do's to Create a Product-Centric IT Organization
17 Must-Do's to Create a Product-Centric IT Organization
Cognizant
 
Unlocking big data
Unlocking big dataUnlocking big data
Data Strategy - Executive MBA Class, IE Business School
Data Strategy - Executive MBA Class, IE Business SchoolData Strategy - Executive MBA Class, IE Business School
Data Strategy - Executive MBA Class, IE Business School
Gam Dias
 
Implementation of Sentimental Analysis of Social Media for Stock Prediction ...
Implementation of Sentimental Analysis of Social Media for Stock  Prediction ...Implementation of Sentimental Analysis of Social Media for Stock  Prediction ...
Implementation of Sentimental Analysis of Social Media for Stock Prediction ...
IRJET Journal
 
Essay On Implementation ERP
Essay On Implementation ERPEssay On Implementation ERP
Essay On Implementation ERP
Kelley Hunter
 

Similar to Knowledge Discovery of Small Business Domain using Web Crawling and Data Mining (20)

Web analytics
Web analyticsWeb analytics
Web analytics
 
IRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum RevolutionIRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum Revolution
 
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
IRJET- Implementing Social CRM System for an Online Grocery Shopping Platform...
 
Data Visualization Tools and Techniques for Datasets in Big Data
Data Visualization Tools and Techniques for Datasets in Big DataData Visualization Tools and Techniques for Datasets in Big Data
Data Visualization Tools and Techniques for Datasets in Big Data
 
8 benefits of business intelligence tools for your organization
8 benefits of business intelligence tools for your organization8 benefits of business intelligence tools for your organization
8 benefits of business intelligence tools for your organization
 
Sister Flowers Personal Essay
Sister Flowers Personal EssaySister Flowers Personal Essay
Sister Flowers Personal Essay
 
Optimizing the Online Business Channel With Web Analytics
Optimizing the Online Business Channel With Web AnalyticsOptimizing the Online Business Channel With Web Analytics
Optimizing the Online Business Channel With Web Analytics
 
ENTREPRENEURSHIPPAPER
ENTREPRENEURSHIPPAPERENTREPRENEURSHIPPAPER
ENTREPRENEURSHIPPAPER
 
Internship report
Internship reportInternship report
Internship report
 
Age Friendly Economy - Improving your business with data
Age Friendly Economy - Improving your business with dataAge Friendly Economy - Improving your business with data
Age Friendly Economy - Improving your business with data
 
IRJET- Virtual Business Analyst using a Progressive Web Application
IRJET- Virtual Business Analyst using a Progressive Web ApplicationIRJET- Virtual Business Analyst using a Progressive Web Application
IRJET- Virtual Business Analyst using a Progressive Web Application
 
Barry Ooi; Big Data lookb4YouLeap
Barry Ooi; Big Data lookb4YouLeapBarry Ooi; Big Data lookb4YouLeap
Barry Ooi; Big Data lookb4YouLeap
 
[Notes] Customer 360 Analytics with LEO CDP
[Notes] Customer 360 Analytics with LEO CDP[Notes] Customer 360 Analytics with LEO CDP
[Notes] Customer 360 Analytics with LEO CDP
 
The CFO in the Age of Digital Analytics
The CFO in the Age of Digital AnalyticsThe CFO in the Age of Digital Analytics
The CFO in the Age of Digital Analytics
 
Big Data
Big DataBig Data
Big Data
 
17 Must-Do's to Create a Product-Centric IT Organization
17 Must-Do's to Create a Product-Centric IT Organization17 Must-Do's to Create a Product-Centric IT Organization
17 Must-Do's to Create a Product-Centric IT Organization
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
Data Strategy - Executive MBA Class, IE Business School
Data Strategy - Executive MBA Class, IE Business SchoolData Strategy - Executive MBA Class, IE Business School
Data Strategy - Executive MBA Class, IE Business School
 
Implementation of Sentimental Analysis of Social Media for Stock Prediction ...
Implementation of Sentimental Analysis of Social Media for Stock  Prediction ...Implementation of Sentimental Analysis of Social Media for Stock  Prediction ...
Implementation of Sentimental Analysis of Social Media for Stock Prediction ...
 
Essay On Implementation ERP
Essay On Implementation ERPEssay On Implementation ERP
Essay On Implementation ERP
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
IRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
IRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
IRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
IRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
IRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
IRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
IRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
IRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
nooriasukmaningtyas
 
Embedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoringEmbedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoring
IJECEIAES
 
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdfIron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
RadiNasr
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
Victor Morales
 
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have oneISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
Las Vegas Warehouse
 
Casting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdfCasting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdf
zubairahmad848137
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
Aditya Rajan Patra
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
IJNSA Journal
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
ihlasbinance2003
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
171ticu
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
jpsjournal1
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
gerogepatton
 
ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024
Rahul
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
VICTOR MAESTRE RAMIREZ
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
Madan Karki
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
bijceesjournal
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
IJECEIAES
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
HODECEDSIET
 

Recently uploaded (20)

A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
 
Embedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoringEmbedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoring
 
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdfIron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
Iron and Steel Technology Roadmap - Towards more sustainable steelmaking.pdf
 
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressionsKuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
KuberTENes Birthday Bash Guadalajara - K8sGPT first impressions
 
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have oneISPM 15 Heat Treated Wood Stamps and why your shipping must have one
ISPM 15 Heat Treated Wood Stamps and why your shipping must have one
 
Casting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdfCasting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdf
 
Recycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part IIIRecycled Concrete Aggregate in Construction Part III
Recycled Concrete Aggregate in Construction Part III
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMSA SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
A SYSTEMATIC RISK ASSESSMENT APPROACH FOR SECURING THE SMART IRRIGATION SYSTEMS
 
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
5214-1693458878915-Unit 6 2023 to 2024 academic year assignment (AutoRecovere...
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
 
ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024ACEP Magazine edition 4th launched on 05.06.2024
ACEP Magazine edition 4th launched on 05.06.2024
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
 
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEMTIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
TIME DIVISION MULTIPLEXING TECHNIQUE FOR COMMUNICATION SYSTEM
 

Knowledge Discovery of Small Business Domain using Web Crawling and Data Mining

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1149 Knowledge Discovery of Small Business Domain Using Web Crawling and Data Mining Latha M1, Shivanand R D2 1M.Tech. Department of Computer Science and Engineering, Bapuji Institute of Technology, Davanagere, Karnataka, India. 2Associate Professor, Department of Computer Science and Engineering, Bapuji Institute of Technology, Davanagere, Karnataka, India. ---------------------------------------------------------------------***------------------------------------------------------------- Abstract - Now a days come to be where everything information is obtainable on the web. If also there are many complications to make use of the web professionally. Due to more information, the users does not get the information related to their requirement. . Among the businesses, a lots of small business are generally not noticed by the people. The static features of small businessare businessname,area, contact and website address of business. Related to small business these static features are easily cached by people. The dynamic features of the business includes reputations. But it is very difficult to get the reputation of small business. Because of the shortage of means of boosting users to write remarks for their services. In this project focused on developing a knowledge base of small business and to give the valuable information about the business to user. To develop a knowledge base of small business static and dynamic data is considered. Thewebcrawlingappliedonthe static data. Twitter considered as the dynamic data source. The user posting the tweets about the business. The posted tweets are analyzed to know whether the tweets contains positive or negative responses about the business. Key Words: Small business, data analysis, web crawling, data mining, knowledge base, knowledge discovery 1. INTRODUCTION Currently more information is available on the web related to any subject. But these more information is to be sort out. Because of the big data [1] defines the three main features such as capacity, speed and diversity. Due to this many complications to utilize the source of web in well- organized manner. Most of the user does not get their required information from the web. Related to this most the web content written in web standard markup language that is HTML. It is intended for web programs to parse so they can draw graphical designs. The graphical designs the most part containing writings, connections, images, and some of the time it also contains different media content. Given the outline reason for HTML to show data on screen is the main purpose of HTML. But it is nearly difficult by the computer programs to retrieve helpful data through the HTML composed content without additional language processing. With respect to above background where information on the web was quickly developing. To increase the performance of the small business is get increasing ifthe there is an efficient small business exist. To develop a well- organized small business it requires a good knowledge. Small Businesses [2] are privately orindependently owned organizations. The size of the small business is limited in size and only few employees are working in that organization. The annual returns of thesmall businessisless compared to the other business. The very less amount is enough to start small this is one of the advantage of small business. Individuality is another advantage of small business. The small business owners have the capacity to manage by themselves it makes a great thing. The small business routines mainly connected to meet the user requirements. Those organizations that embrace this promoting idea will probably succeed when they are consolidated with development and differential procedures .The market-orientated vital stances of effective private companies are directed by therequirementsofclient drove everyday exercises. Small business successful is mainly concentrating on the handling of customer drive and market positioning. If these two perspectives are well handled by the small business means it leads to a successful, the present with upcoming. With respect to above,developinga knowledgebase [3] of small business and to deliver the information about the business to users. Now a days a lots of business are developing rapidly. But the small businesses are not easily noticed by the people. Related to small business the static data once stored it’s not change over time. The static data of the small business are business name, area, contact and website address. These data are easily available to the public. But the dynamic data it will change over time. The dynamic features of the business are user opinion about the services or administration. These features are very difficult to retrieve. Because they have a lack of methods to support user to write remarks about for their administration services. The Small business examples are hotels, stores, restaurant, bakery and shops. In this project restaurant is taken as the small business example.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1150 Primary needs of human beings are food and beverages. These primary needs are provided byrestaurant. Because of this the restaurant business act as the lifespan business. And it’s not a certainly losing attraction in customer’s encouragement. If the owner of the restaurant is good capacity to achieve and run it well means the restaurant becomes fast growing. Some individuals thought that if good quality of food and well atmosphere services provided to customer’s means to become a successful business. But many functions that are considered for the successful of restaurant not only good taste of food. To retrieve the reputation of the small business it is very difficult so for that social media considered for the reputation. Now a days the common thing by the people is posting the status about their daily life on social media. Usually the status of the people in the form of comments. These comments involves where they went to spend their valuable time, in which hotel they had food and what is the taste of the food and how was environment of that hotel. The people added status about their daily life usually it is just positive or negative opinions about the service taken by particular organization. The userpostedcommentsonsocial media are in the form of natural languages and these comments are does not indicating any rating. So some data mining techniques are requires to increase the accuracy of the information. In this project focused on developing a knowledge discovery [4] of small business.Firstbusinessstatic data that is website address is considered. And dynamic data as the Twitter [5]. The web crawling applied on the static data. Then extraction of information from the corresponding webpage. Then allow the user to post the comments about the particular business. Then the user posted comments are analyzed. The analyzing of user comments to know whether the tweets [6] contains positive or negative response about the business. If the comments are more positive responses about the business then it gives the total number of positive tweets with accuracy about the particular business. If the comments are more negative responses about the business then it gives the total number of negative tweets with accuracy about the particular business. So some kind ofdata analysis [7] is needed to figure out how much positive or negative each of the tweets are about the business. 1.1 Web Crawling Web crawling [7] is the process of search engines combing through web pages inordertoproperlyindexthem. Web crawling makes it easier for search engine toreturn the most relevant results to user after they enter a searchquery. 1.2 Data Mining Data mining [8] is the automatic or semiautomatic analysis of large quantities of data to extract previously unknown, interesting patterns such as groups of data records, unusual records and dependencies. 2. SYSTEM ARCHITECTURE Fig -1: Knowledge Discovery Process The overall processes for the knowledge discovery are overviewed in the Figure 1.It involves the web crawling on static data source. Image and information extracted from the retrieved page is stored in knowledge base. Twitter as the dynamic data source. Users posted tweets are analyzed with the accuracy. 3. RELATED WORK The following sections gives the related work for this paper. Related to knowledge discovery of small business both static and dynamic data is considered. 3.1 Web Crawling On Static Data The initial stage involves data collection of the form static. The defined static data that are not produced byusers and likely to remain unchanged for a long time as staticdata. These data include their names, locations, and contacts, types of businesses. In this phase web crawling applied on the static data of business to produce knowledge base of small business. From the crawled page image and information is extracted is storedintheknowledgebase. The website address of Apoorva resorts is located in davanagere is http://www.apoorvaresortsdavangere.in.The following figure shows the web crawling on static data.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1151 Fig -2: Web Crawling Process 3.2 Tweets Analysis In this phase first user Twitter account created. To generate Twitter API keys, Access TokenandSecretKeys the Twitter application is created. To create Twitter Application which is mandatory to access Twitter. It includes the following steps. 1. Visit the website https://dev.twitter.com/apps/new and logging into Twitter account then click Create New App.It is shown in the figure 3 and figure 4. Fig-3: Home Page of Twitter Developer Platform Fig-4: Login Page of Twitter 2. On the Create an application page, enter the requested information for the new twitter application. Enter the Application Name, Description and user website address. Choose an application name it must not be already taken by another user. It is shown in the following figure 5. 3. Twitter Application is created. On the main page, click Keys and Access Tokens. This Application Settings section contains the Consumer KeyandConsumer Secret. Itisshown in the following figure 6. 4. Under the Your Access Token section, click Create my access token. The access tokens are generated and ready to be used. It is shown in the following figure 7. Fig-5: Creation of Twitter Application Details
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1152 Fig-6: Consumer Key and Consumer Secret Fig-7: Access Token 5. Twitter authentication information includesthefollowing elements: Consumer Key, Consumer Secret, Access Token and Access Token Secret. All these need for programmatically to Twitter. Then finally related to tweet analysis the user will tweets are posted and loaded. And tweets are analyzed either positive or negative with the accuracy. The entire tweets analysis is shown in the figure 8. Fig-8-: Tweets Analysis 4. RESULTS The following sections gives the discussions on experimental results. 4.1 web crawling Web crawling is applied on the Website of Apoorva resorts Davangere. And the html page is crawled. Itisshown in the figure 9. Fig-9: Crawled HTML Page 4.2 Extraction of All Images: Images of Apoorva resorts are extracted from the crawled html page. It is shown in the figure 10. Fig-10: Image Extraction from the Crawled HTML Page 4.3 Tweets Analysis Related to this the user posting the tweets about the business. The user tweets are may contain positive or negative tweets.After the posting of user tweets,then tweets are loaded to user Twitter account. After entering the particular item for analysis. Then it shows the total number
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1153 of tweets belongs to that particularcategory.Thisshowsthat the total number of tweets and also positive or negative tweets with accuracy about the business. All these procedures shown in the following figures. Fig-11: Posting of Tweets Fig-12: Loading of Tweets into user Twitter account Fig-13: Selecting of particular Category Fig-14: Analysis of Tweets of particular category Fig-15: Analysis of posted Tweets
  • 6. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 04 Issue: 08 | Aug -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1154 Fig-16: Analysis of posted Tweets with accuracy 5. CONCLUSION AND FUTURE WORK The Knowledge base of small business is built by considering static and dynamic data. Thestatic data includes business name, location, contact and website address of businesses. Web crawling applied on static data. And for the dynamic data, Twitter is considered. However, user comments on social media usually do not have ratings to indicate how much positive or negative their reactions are. So comments or reviews are analyzed. The user posted tweets into Twitter are analyzed to knowhowmuchpositive or negative reactions about the businesses. Thus user opinions are useful to understand their preferences, and reputations of places or services they used. For future work there is also the need to make sure the data are reliable. Since there always can be fake reviews from users who write reviews out of malicious purposes or are hired by businesses to write glowing reviews on social media. Sometimesinformationfromusersorwebsitescanbe out of date or not consistent with each other. These factors should be considered in the future as well to improve the reliability of knowledge discovery system. 5. REFERENCES [1] Edd Dumbill, Forbes, Volume, Velocity, Variety: “What You Need to Know About Big Data”, JAN 2012. [2] Headd, Brian and Bruce Kirchhoff, “The growth, decline and survival of small businesses:Anexploratorystudyof life cycles,” Journal of Small Business Management, pp. 531-550, October2009. [3] Akerkar, R.A. and Sajja, P.S. 2009. Knowledge-based systems: Jones & Bartlett Publishers, Sudbury,MA, USA [4] Ravindra Changala, D.Rajeswara Rao,TJanardhana Rao, P Kiran Kumar, Kareemunnisa ,2015 “Knowledge Discovery Process: The Next Step for KnowledgeSearch [5] F.Morstatter, S. Kumar, H. Liu, and R. Maciejewski. Understanding Twitter Data with TweetXplorer. In Proceedings of the 2013 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 2013. [6] Raheleh Makki,Axel j.Soto, Stephen Brooks 2016. Twitter message recommendation based on user interest profiles [7] Elyasir, Ayoub Mohamed, and Kalaiarasi Sonai Muthu. "Focused Web Crawler." International Conference on InformationandKnowledgeManagement(ICIKM2012). [8] Hemlata Sahu, Shalini Shrma, Seema Gondhalakar.”A Brief Overview on Data Mining Survey “2008 International Journal of Computer Technology and Electronics Engineering (IJCTEE) Volume 1, Issue 3