SlideShare a Scribd company logo
1 of 3
Download to read offline
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 04 | Apr 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1569
Building Your Own Search Engine
Mansi Vinzani1, Divya Salian2, Priyesh Bapna3, Prof. Sneha Bendale4
1,2,3Student, Department of Computer Engineering, Terna Engineering College, Maharashtra, India
4Professor, Department of Computer Engineering, Terna Engineering College, Maharashtra,, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract –21st Century being an age of digitalworld makes
every information available online and to be able to access
such huge amount of information on variety of subjects is only
possible by having a search engine. We have so many search
engines available by now but the most prominent search
engine in the world right now is ‘Google Search Engine’. Every
other search engine is different from thatofothers. Thereason
for this difference is primarily the use of algorithms or the
combination of algorithms in use. Everyalgorithmwillhave its
own pros and cons. Our project titled as ‘Building our own
search engine’ is based on the concept of web-crawling and
indexing. The implementation of these 2 will form the major
chunk of our project. Our web-crawler (spider/bot) will be
essentially sorting through the Internet to find website
addresses and the contents of a website for storage in the
search engine database. Then comes the job of our Indexer
which will be indexing the content based on the occurrence of
keyword phrases in each individual website. Also, our search
engine would store the web content within the database
essential for fast and easy searching. Finally the output of our
project are the hyperlinks to websites that show up in the
search engine page when a certain keyword or phrase is
queried.
Key Words: Crawler, Heap sorting, Page Scoring,
Breadth first search.
1. INTRODUCTION
A search engine is a web-based tool which allows the
internet users to find information on the internet. Most
commonly used search engines are Google, Yahoo!, MSN,
Bing, Ask etc. Search engines are special types of programs
used to search documents having specified keywords and
returns a list of documents where the keywords are located.
A search engine is usually a general collection of programs.
However, the term ‘search engine’ is often used to generally
describe the common systems like Google, Bing and Yahoo!
Search engines generally use automated software
applications e.g. robots or spiders which moves across the
Web and follows the links from page to page, site to site. A
typical Web Search Engine starts working by sending out a
spider which has the ability to fetch as many documents as
possible against the supplied keywords. There is another
program called the indexer, and then starts reading these
documents and starts creating the indices based on the
tokens found in each document. Each search engine uses its
own proprietary algorithm to create the indices in such a
way that ideally, only meaningful results are returned for
each query. A typical search engine consists of few parts –
crawler which is used to pull external documents -An index
which is the place where the documents are stored in an
inverted tree and a document store to keep the documents.
2. LITERATURE SURVEY
[1] A survey on Search Engine Optimization is done. Survey
on different search engines are made to check how user
friendly, stable, secure and fast the search engine is.
Different optimization techniques are usedsuchasKeyword
Search, Web optimization, Social Media, Blogs, Forums,
Articles, Analytics and Press Releases.
[2]Different parts of search engines are explained. Basic
Features of search engine is explained. Types of search
engines such as Search directories or indexes,Hybridsearch
engines and Meta search engine work isexplained.Thereare
three important parts of search engine i.e.SpiderorCrawler,
Index, catalog or database and Search Engine Software.
[3]Major Search Engines like Google, Yahoo and Bing are
being compared. Different techniques and comparisons are
made. Speed, Accuracy, Security, Related Content, and no of
results which are useful are compared. Google and Yahoo
turns out to be the best but when it comes to no of useful
results, Google turns out to be the best.
[4]A search engine to support web terrorism mining offers
un identifies and precious data for the Law enforcement
organization incessantly. The process to tackle examining
the criminal believes of forensic data investigationregarding
prioritizes capable links andpageswhichwrapthe reliability
and consistency gap by offering a framework. This argue
with variety of open issues in this area.
[5]Bee Swarm algorithm is used instead of regular breadth
first search algorithm in making of this search engine.
Swarm intelligence is used to crawl the web instead of
breadth first search. It works like a bee system to collect
search results fast as well as accurate.
[6]Web mining and data mining techniquesisusedtomakea
personalized web search engine. Data overloadproblemcan
be overcome using web mining and data mining techniques.
Search can be made Classifies as well as Personalized. Web
personalization is done. Strategies of web personalization
and their related work is made.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 04 | Apr 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1570
3. METHODOLOGY
Methodology is divided into four parts as follows:
3.1. Admin Process
Admin Login: Admin login into the system using username
and password. After successful login, admin can access the
crawler.
Dashboard: On this window search bar for query isavailable
and there are two more buttons available i.e. ‘Extract top 20
links’ and ‘Extract Max URLS’. Extract max URLs will show
the URL with the highest page score.
3.2. User Query
User is provided by a search bar in which user types the
keyword that he/she wants to have information about. The
result will be displayed to the user in the form of URL links.
First 30 links on the basis of page score will be displayed.
3.3. Web Crawler
The web crawler begins and works independently. To
initiate the crawler we need to provide the seed URL linksto
it. Tracing which it can independently download the
websites. The web crawler goes from one link toanotherina
breadth first search manner sothatall primaryrelevantsites
are crawled first.
3.4. Indexer
After the web crawler has crawled the websites, it will be
saving it into a database. An online database is used for this
purpose.
3.5. Page Scoring
Page Scoring is done with the help of calculating 2 scores i.e.
Keyword score and link score. Keyword score is on the basis
of occurrence of that keyword on the webpage as to how
many times that was word has been encountered in the
website. Word count of that keyword is recorded. Second
one is by calculating Link score. Link score is determined by
tracing all the href tags on the webpage to get the count of
outgoing links. This is also recorded and kept. Now the page
score is calculated by adding the above two scores.
3.6. Searching Interface Steps :
Step 1: Start
Step 2: Admin Login
Step 3: Enter the keyword for your search.
Step 4: Hit the search button
Step 5: Result displays 30 URLs
Step 6: Hit the button ‘Top 20 links”
Step 7: Result displays top 20 links on the basisofpagescore
Step 8: Hit the button ‘Extract max URLs’
Step 9: Result will display one such link which has the
highest page score.
4. FLOW CHART OF THE SYSTEM
4.1. Flowchart
5. RESULTS
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 04 | Apr 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1571
6. CONCLUSION
Search Engine which is one of the most powerful tool crawls
up so many websites to bring up the result on the display
screen. Every search engine will have different results being
displayed on the top because of the primary reason of using
different algorithms to crawl and ranking the pages. Further
by optimizing the search algorithm more relevant websites
can be displayed on the top, increasing the efficiency of the
search.
7. REFERENCES
1] Prashant Ankalkoti “Survey on Search Engine
Optimization”, Imperial Journal of Interdisciplinary
research, vol – 3, issue – 5, 2017.
[2] W.Bruce croft, Donald Metzler, Trevor Strohman “
Search Engines”, Pearson Education Inc, 2015.
[3]BhupendraRatha“SearchEngine”,SchoolofLibrary
and Information Science.
[4] Amim SHahraki, Javad Hosseinkhan, Surayti
Chuprat “Applying Social Network Analysis in Crawler
Based Search Engine”, International Journal of
Computer Science and Network Security,VOL.17No.8,
August 2017.
[5] Mohammed Ibrahim Sahujaa, Ahmed bahaa Ulddin
“Building WebCrawler Based onBeeSwanAlgorithm”,
International Journal of Computer Science and
Network Secrity, Vol. 10, Issue 5, No, 1, September
2013.
[6] K. Ppoornsiri, S.Radha Priya “A Literature Review
on Personalized WebSearch”, ResearchDepartmentof
Computer Science, Coimbatore, India.

More Related Content

What's hot

Detailed study of aggregator for updates
Detailed study of aggregator for updatesDetailed study of aggregator for updates
Detailed study of aggregator for updateseSAT Journals
 
SEARCH ENGINE BY SAIKIRAN PANJALA
SEARCH ENGINE BY SAIKIRAN PANJALASEARCH ENGINE BY SAIKIRAN PANJALA
SEARCH ENGINE BY SAIKIRAN PANJALASaikiran Panjala
 
Detailed study of aggregator for updates
Detailed study of aggregator for updatesDetailed study of aggregator for updates
Detailed study of aggregator for updateseSAT Publishing House
 
Classification of search_engine
Classification of search_engineClassification of search_engine
Classification of search_engineBookStoreLib
 
Network paperthesis2
Network paperthesis2Network paperthesis2
Network paperthesis2Dhara Shah
 
A Study on SEO Monitoring System Based on Corporate Website Development
A Study on SEO Monitoring System Based on Corporate Website DevelopmentA Study on SEO Monitoring System Based on Corporate Website Development
A Study on SEO Monitoring System Based on Corporate Website Developmentijcseit
 
Quest Trail: An Effective Approach for Construction of Personalized Search En...
Quest Trail: An Effective Approach for Construction of Personalized Search En...Quest Trail: An Effective Approach for Construction of Personalized Search En...
Quest Trail: An Effective Approach for Construction of Personalized Search En...Editor IJCATR
 
BLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERING
BLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERINGBLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERING
BLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERINGijasa
 
Using Exclusive Web Crawlers to Store Better Results in Search Engines' Database
Using Exclusive Web Crawlers to Store Better Results in Search Engines' DatabaseUsing Exclusive Web Crawlers to Store Better Results in Search Engines' Database
Using Exclusive Web Crawlers to Store Better Results in Search Engines' DatabaseIJwest
 
How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works reportSovan Misra
 
Meta Search Engines
Meta Search EnginesMeta Search Engines
Meta Search Enginesvcsstudent
 
Internet Tutorial 03
Internet  Tutorial 03Internet  Tutorial 03
Internet Tutorial 03dpd
 
Review of the machine learning methods in the classification of phishing attack
Review of the machine learning methods in the classification of phishing attackReview of the machine learning methods in the classification of phishing attack
Review of the machine learning methods in the classification of phishing attackjournalBEEI
 
IJSRED-V2I4P0
IJSRED-V2I4P0IJSRED-V2I4P0
IJSRED-V2I4P0IJSRED
 
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...IRJET Journal
 
How search engine works
How search engine worksHow search engine works
How search engine worksAshraf Ali
 
Metasearch engines
Metasearch enginesMetasearch engines
Metasearch enginesHamza Saeed
 

What's hot (20)

Detailed study of aggregator for updates
Detailed study of aggregator for updatesDetailed study of aggregator for updates
Detailed study of aggregator for updates
 
SEARCH ENGINE BY SAIKIRAN PANJALA
SEARCH ENGINE BY SAIKIRAN PANJALASEARCH ENGINE BY SAIKIRAN PANJALA
SEARCH ENGINE BY SAIKIRAN PANJALA
 
Detailed study of aggregator for updates
Detailed study of aggregator for updatesDetailed study of aggregator for updates
Detailed study of aggregator for updates
 
Classification of search_engine
Classification of search_engineClassification of search_engine
Classification of search_engine
 
Network paperthesis2
Network paperthesis2Network paperthesis2
Network paperthesis2
 
A Study on SEO Monitoring System Based on Corporate Website Development
A Study on SEO Monitoring System Based on Corporate Website DevelopmentA Study on SEO Monitoring System Based on Corporate Website Development
A Study on SEO Monitoring System Based on Corporate Website Development
 
Quest Trail: An Effective Approach for Construction of Personalized Search En...
Quest Trail: An Effective Approach for Construction of Personalized Search En...Quest Trail: An Effective Approach for Construction of Personalized Search En...
Quest Trail: An Effective Approach for Construction of Personalized Search En...
 
Meta Search Engine: An Introductory Study
Meta Search Engine: An Introductory StudyMeta Search Engine: An Introductory Study
Meta Search Engine: An Introductory Study
 
Gg3411451152
Gg3411451152Gg3411451152
Gg3411451152
 
BLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERING
BLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERINGBLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERING
BLOSEN: BLOG SEARCH ENGINE BASED ON POST CONCEPT CLUSTERING
 
Meta search engine
Meta search engineMeta search engine
Meta search engine
 
Using Exclusive Web Crawlers to Store Better Results in Search Engines' Database
Using Exclusive Web Crawlers to Store Better Results in Search Engines' DatabaseUsing Exclusive Web Crawlers to Store Better Results in Search Engines' Database
Using Exclusive Web Crawlers to Store Better Results in Search Engines' Database
 
How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works report
 
Meta Search Engines
Meta Search EnginesMeta Search Engines
Meta Search Engines
 
Internet Tutorial 03
Internet  Tutorial 03Internet  Tutorial 03
Internet Tutorial 03
 
Review of the machine learning methods in the classification of phishing attack
Review of the machine learning methods in the classification of phishing attackReview of the machine learning methods in the classification of phishing attack
Review of the machine learning methods in the classification of phishing attack
 
IJSRED-V2I4P0
IJSRED-V2I4P0IJSRED-V2I4P0
IJSRED-V2I4P0
 
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...
IRJET- Effective Countering of Communal Hatred During Disaster Events in Soci...
 
How search engine works
How search engine worksHow search engine works
How search engine works
 
Metasearch engines
Metasearch enginesMetasearch engines
Metasearch engines
 

Similar to IRJET - Building Your Own Search Engine

Retrospect of Search Engine Optimization Techniques
Retrospect of Search Engine Optimization TechniquesRetrospect of Search Engine Optimization Techniques
Retrospect of Search Engine Optimization TechniquesIRJET Journal
 
Calculating Rank of Web Documents Using Its Content and Link Analysis
Calculating Rank of Web Documents Using Its Content and Link AnalysisCalculating Rank of Web Documents Using Its Content and Link Analysis
Calculating Rank of Web Documents Using Its Content and Link AnalysisIRJET Journal
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawlerishmecse13
 
Introduction To Internet Marketing
Introduction To Internet MarketingIntroduction To Internet Marketing
Introduction To Internet MarketingYashobanta Bai
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...inventionjournals
 
Introduction to Search Engine Optimization
Introduction to Search Engine OptimizationIntroduction to Search Engine Optimization
Introduction to Search Engine OptimizationGauravPrajapati39
 
Internet magazines practical tips for improving website traffic
Internet magazines  practical tips for improving website trafficInternet magazines  practical tips for improving website traffic
Internet magazines practical tips for improving website trafficHamideh Iraj
 
Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Nandu B Rajan
 
10 personalized-web-search-techniques
10 personalized-web-search-techniques10 personalized-web-search-techniques
10 personalized-web-search-techniquesdipanjalishipne
 
International conference On Computer Science And technology
International conference On Computer Science And technologyInternational conference On Computer Science And technology
International conference On Computer Science And technologyanchalsinghdm
 
Search Engine Optimization - Aykut Aslantaş
Search Engine Optimization - Aykut AslantaşSearch Engine Optimization - Aykut Aslantaş
Search Engine Optimization - Aykut AslantaşAykut Aslantaş
 
How search engine works
How search engine worksHow search engine works
How search engine worksAshraf Ali
 

Similar to IRJET - Building Your Own Search Engine (20)

Retrospect of Search Engine Optimization Techniques
Retrospect of Search Engine Optimization TechniquesRetrospect of Search Engine Optimization Techniques
Retrospect of Search Engine Optimization Techniques
 
G017254554
G017254554G017254554
G017254554
 
Calculating Rank of Web Documents Using Its Content and Link Analysis
Calculating Rank of Web Documents Using Its Content and Link AnalysisCalculating Rank of Web Documents Using Its Content and Link Analysis
Calculating Rank of Web Documents Using Its Content and Link Analysis
 
Rutuja SEO.pdf
Rutuja SEO.pdfRutuja SEO.pdf
Rutuja SEO.pdf
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawler
 
Seo report
Seo reportSeo report
Seo report
 
Introduction To Internet Marketing
Introduction To Internet MarketingIntroduction To Internet Marketing
Introduction To Internet Marketing
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
 
Touch With Industry
Touch With IndustryTouch With Industry
Touch With Industry
 
Lesson 4.pdf
Lesson 4.pdfLesson 4.pdf
Lesson 4.pdf
 
Introduction to Search Engine Optimization
Introduction to Search Engine OptimizationIntroduction to Search Engine Optimization
Introduction to Search Engine Optimization
 
Internet magazines practical tips for improving website traffic
Internet magazines  practical tips for improving website trafficInternet magazines  practical tips for improving website traffic
Internet magazines practical tips for improving website traffic
 
Search engine
Search engineSearch engine
Search engine
 
Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Search Engine Optimization (SEO)
Search Engine Optimization (SEO)
 
Search engine
Search engineSearch engine
Search engine
 
Seo guide
Seo guideSeo guide
Seo guide
 
10 personalized-web-search-techniques
10 personalized-web-search-techniques10 personalized-web-search-techniques
10 personalized-web-search-techniques
 
International conference On Computer Science And technology
International conference On Computer Science And technologyInternational conference On Computer Science And technology
International conference On Computer Science And technology
 
Search Engine Optimization - Aykut Aslantaş
Search Engine Optimization - Aykut AslantaşSearch Engine Optimization - Aykut Aslantaş
Search Engine Optimization - Aykut Aslantaş
 
How search engine works
How search engine worksHow search engine works
How search engine works
 

More from IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

More from IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Recently uploaded

microprocessor 8085 and its interfacing
microprocessor 8085  and its interfacingmicroprocessor 8085  and its interfacing
microprocessor 8085 and its interfacingjaychoudhary37
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 

Recently uploaded (20)

microprocessor 8085 and its interfacing
microprocessor 8085  and its interfacingmicroprocessor 8085  and its interfacing
microprocessor 8085 and its interfacing
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 

IRJET - Building Your Own Search Engine

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 04 | Apr 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1569 Building Your Own Search Engine Mansi Vinzani1, Divya Salian2, Priyesh Bapna3, Prof. Sneha Bendale4 1,2,3Student, Department of Computer Engineering, Terna Engineering College, Maharashtra, India 4Professor, Department of Computer Engineering, Terna Engineering College, Maharashtra,, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract –21st Century being an age of digitalworld makes every information available online and to be able to access such huge amount of information on variety of subjects is only possible by having a search engine. We have so many search engines available by now but the most prominent search engine in the world right now is ‘Google Search Engine’. Every other search engine is different from thatofothers. Thereason for this difference is primarily the use of algorithms or the combination of algorithms in use. Everyalgorithmwillhave its own pros and cons. Our project titled as ‘Building our own search engine’ is based on the concept of web-crawling and indexing. The implementation of these 2 will form the major chunk of our project. Our web-crawler (spider/bot) will be essentially sorting through the Internet to find website addresses and the contents of a website for storage in the search engine database. Then comes the job of our Indexer which will be indexing the content based on the occurrence of keyword phrases in each individual website. Also, our search engine would store the web content within the database essential for fast and easy searching. Finally the output of our project are the hyperlinks to websites that show up in the search engine page when a certain keyword or phrase is queried. Key Words: Crawler, Heap sorting, Page Scoring, Breadth first search. 1. INTRODUCTION A search engine is a web-based tool which allows the internet users to find information on the internet. Most commonly used search engines are Google, Yahoo!, MSN, Bing, Ask etc. Search engines are special types of programs used to search documents having specified keywords and returns a list of documents where the keywords are located. A search engine is usually a general collection of programs. However, the term ‘search engine’ is often used to generally describe the common systems like Google, Bing and Yahoo! Search engines generally use automated software applications e.g. robots or spiders which moves across the Web and follows the links from page to page, site to site. A typical Web Search Engine starts working by sending out a spider which has the ability to fetch as many documents as possible against the supplied keywords. There is another program called the indexer, and then starts reading these documents and starts creating the indices based on the tokens found in each document. Each search engine uses its own proprietary algorithm to create the indices in such a way that ideally, only meaningful results are returned for each query. A typical search engine consists of few parts – crawler which is used to pull external documents -An index which is the place where the documents are stored in an inverted tree and a document store to keep the documents. 2. LITERATURE SURVEY [1] A survey on Search Engine Optimization is done. Survey on different search engines are made to check how user friendly, stable, secure and fast the search engine is. Different optimization techniques are usedsuchasKeyword Search, Web optimization, Social Media, Blogs, Forums, Articles, Analytics and Press Releases. [2]Different parts of search engines are explained. Basic Features of search engine is explained. Types of search engines such as Search directories or indexes,Hybridsearch engines and Meta search engine work isexplained.Thereare three important parts of search engine i.e.SpiderorCrawler, Index, catalog or database and Search Engine Software. [3]Major Search Engines like Google, Yahoo and Bing are being compared. Different techniques and comparisons are made. Speed, Accuracy, Security, Related Content, and no of results which are useful are compared. Google and Yahoo turns out to be the best but when it comes to no of useful results, Google turns out to be the best. [4]A search engine to support web terrorism mining offers un identifies and precious data for the Law enforcement organization incessantly. The process to tackle examining the criminal believes of forensic data investigationregarding prioritizes capable links andpageswhichwrapthe reliability and consistency gap by offering a framework. This argue with variety of open issues in this area. [5]Bee Swarm algorithm is used instead of regular breadth first search algorithm in making of this search engine. Swarm intelligence is used to crawl the web instead of breadth first search. It works like a bee system to collect search results fast as well as accurate. [6]Web mining and data mining techniquesisusedtomakea personalized web search engine. Data overloadproblemcan be overcome using web mining and data mining techniques. Search can be made Classifies as well as Personalized. Web personalization is done. Strategies of web personalization and their related work is made.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 04 | Apr 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1570 3. METHODOLOGY Methodology is divided into four parts as follows: 3.1. Admin Process Admin Login: Admin login into the system using username and password. After successful login, admin can access the crawler. Dashboard: On this window search bar for query isavailable and there are two more buttons available i.e. ‘Extract top 20 links’ and ‘Extract Max URLS’. Extract max URLs will show the URL with the highest page score. 3.2. User Query User is provided by a search bar in which user types the keyword that he/she wants to have information about. The result will be displayed to the user in the form of URL links. First 30 links on the basis of page score will be displayed. 3.3. Web Crawler The web crawler begins and works independently. To initiate the crawler we need to provide the seed URL linksto it. Tracing which it can independently download the websites. The web crawler goes from one link toanotherina breadth first search manner sothatall primaryrelevantsites are crawled first. 3.4. Indexer After the web crawler has crawled the websites, it will be saving it into a database. An online database is used for this purpose. 3.5. Page Scoring Page Scoring is done with the help of calculating 2 scores i.e. Keyword score and link score. Keyword score is on the basis of occurrence of that keyword on the webpage as to how many times that was word has been encountered in the website. Word count of that keyword is recorded. Second one is by calculating Link score. Link score is determined by tracing all the href tags on the webpage to get the count of outgoing links. This is also recorded and kept. Now the page score is calculated by adding the above two scores. 3.6. Searching Interface Steps : Step 1: Start Step 2: Admin Login Step 3: Enter the keyword for your search. Step 4: Hit the search button Step 5: Result displays 30 URLs Step 6: Hit the button ‘Top 20 links” Step 7: Result displays top 20 links on the basisofpagescore Step 8: Hit the button ‘Extract max URLs’ Step 9: Result will display one such link which has the highest page score. 4. FLOW CHART OF THE SYSTEM 4.1. Flowchart 5. RESULTS
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 04 | Apr 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1571 6. CONCLUSION Search Engine which is one of the most powerful tool crawls up so many websites to bring up the result on the display screen. Every search engine will have different results being displayed on the top because of the primary reason of using different algorithms to crawl and ranking the pages. Further by optimizing the search algorithm more relevant websites can be displayed on the top, increasing the efficiency of the search. 7. REFERENCES 1] Prashant Ankalkoti “Survey on Search Engine Optimization”, Imperial Journal of Interdisciplinary research, vol – 3, issue – 5, 2017. [2] W.Bruce croft, Donald Metzler, Trevor Strohman “ Search Engines”, Pearson Education Inc, 2015. [3]BhupendraRatha“SearchEngine”,SchoolofLibrary and Information Science. [4] Amim SHahraki, Javad Hosseinkhan, Surayti Chuprat “Applying Social Network Analysis in Crawler Based Search Engine”, International Journal of Computer Science and Network Security,VOL.17No.8, August 2017. [5] Mohammed Ibrahim Sahujaa, Ahmed bahaa Ulddin “Building WebCrawler Based onBeeSwanAlgorithm”, International Journal of Computer Science and Network Secrity, Vol. 10, Issue 5, No, 1, September 2013. [6] K. Ppoornsiri, S.Radha Priya “A Literature Review on Personalized WebSearch”, ResearchDepartmentof Computer Science, Coimbatore, India.