SlideShare a Scribd company logo
1 of 23
How Google search engine algorithm works 
Prepared by:- Viral Shah (120570107014) 
Guided by :- Prof. Sahista Machhar, MEFGI
It is a program that 
searches for and 
identifies items in a 
database that 
correspond to 
keywords or 
characters specified 
by the user, used 
especially for finding 
particular sites on the 
World Wide Web.
 There are 759 Million websites on the Web & 
60 Trillion webpages of this websites. 
 AND IT’S CONSTANTLY GROWING !!!!!
 GOOGLE navigates WEB by 
crawling. 
 To find information on the 
hundreds of millions of Web 
pages that exist, a search 
engine employs special 
software robots, called 
SPIDERS, to build lists of the 
words found on Web sites. 
When a spider is building its 
lists, the process is called 
Web crawling.
 The usual starting points are lists of heavily 
used servers and very popular pages. The 
spider will begin with a popular site, indexing 
the words on its pages and following every 
link found within the site. In this way, the 
spidering system quickly begins to travel, 
spreading out across the most widely used 
portions of the Web.
 When the Google spider looked at an HTML page, it took note of 
following things:- 
Words occurring in the title, subtitles, meta tags and other 
positions of relative importance were noted for special consideration 
during a subsequent user search. The Google spider was built to index 
every significant word on a page, leaving out the articles “a”, “an” and 
"the”. Other spiders take different approaches. 
 For example, some spiders will keep track of the words in the title, 
sub-headings and links, along with the 100 most frequently used 
words on the page and each word in the first 20 lines of text. Lycos is 
said to use this approach to spidering the Web. 
 GOOGLE built their initial system to use multiple spiders, usually three 
at one time. Each spider could keep about 300 connections to Web 
pages open at a time.
 Google’s spider name is Googlebot. 
 Googlebot is the search bot software used 
by Google, which collects documents from 
the web to build a searchable index for 
the Google Search engine.
 By following the web-pages, INDEX is 
prepared. The index includes text from 
millions of books from several libraries and 
other partners. 
 That means GOOGLE follow links from page 
to page. Also they sort pages by their content 
and other factors.
 These all activities Google carry out is tracked 
in the INDEX. Google continuously updates 
index and it is stored over large servers. 
 Currently, Google’s Index size is over 100 
million Gigabyte.
 Site owners choose whether their sites are 
crawled. 
 To prevent most search engine web 
crawlers from indexing a page on your site, place 
the following meta tag into the<head> section of 
your page: 
<meta name="robots" content="noindex"> 
 To prevent only Google web crawlers from 
indexing a page: 
<meta name="googlebot" content="noindex">
1) AUTOCOMPLETE 
Predicts what you might be searching for. 
This includes understanding terms with more 
than one meaning. 
2) SYNONYMS 
Recognizes words with similar meanings.
3) QUERY UNDERSTANDING 
Gets to the deeper meaning of the words 
you type. 
4) GOOGLE INSTANT 
Displays immediate results as you type. 
5) SPELLING 
Identifies and corrects possible spelling 
errors and provides alternatives.
 Based on all the above factors, Google picks 
some web-pages from the index. 
 Then, Google ranks the result on various 
factors. 
 1) Site & Page Quality:- 
It is checked by how you are writing 
key-words.
2) Freshness:- 
How much fresh the content is & at how 
much regular interval it is updated !! 
3) Safe-Search:- 
Google tries to find out how much it is safe 
and doesn’t contains spams. 
Along with these, there are 200+ factors used 
by Google to rank any particular webs-page.
 After all these operations, you will get the 
desired result and these all happens in one 
nano-second !!!
 Google fights with spam every second to give 
true & relevant result. 
 The majority of spam removal is 
automatic. Google examine other 
questionable documents by hand. If Google 
find spam, they take manual action.
1) PURE SPAM 
Site appears to use aggressive spam 
techniques such as automatically generated 
gibberish, cloaking, scraping content from 
other websites, and/or repeated or egregious 
violations of Google's Webmaster Guidelines. 
2) HIDDEN TEXT AND/OR KEYWORD STUFFING 
Some of the pages may contain hidden 
text and/or keyword stuffing.
3) USER-GENERATED SPAM 
Site appears to contain spammy user-generated 
content. The problematic content 
may appear on forum pages, guestbook pages, 
or user profiles. 
4) PARKED DOMAINS 
Parked domains are placeholder sites with little 
unique content, so Google doesn't typically 
include them in search results.
5) THIN CONTENT WITH LITTLE OR 
NO ADDED VALUE 
Site appears to consist of low-quality or shallow pages 
which do not provide users with much added value 
(such as thin affiliate pages, doorway pages, cookie-cutter 
sites, automatically generated content, or copied 
content). 
6) UNNATURAL LINKS TO A SITE 
Google has detected a pattern of unnatural artificial, 
deceptive or manipulative links pointing to the site. 
These may be the result of buying links that pass 
PageRank or participating in link schemes.
 Besides these all there are thousands other 
factors Google uses to detect Spam and 
decides the page-rank of web-page 
accordingly which is constantly updated and 
finally Google only keeps trusted documents 
in index.
 And the point of Interest is that to make 
presentation on google, I used
 Behind your simple page of results is a 
complex system, carefully crafted and 
tested, to support more than one-hundred 
billion searches each month !!!! 
How Google Search Engine Algorithm Works ??

More Related Content

What's hot

How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works reportSovan Misra
 
Google ppt by amit
Google ppt by amitGoogle ppt by amit
Google ppt by amitDAVV
 
Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...joelmaster
 
Comparing Search Engines
Comparing Search EnginesComparing Search Engines
Comparing Search EnginesMelissa Brisbin
 
Search engine optimization (seo)
Search engine optimization (seo)Search engine optimization (seo)
Search engine optimization (seo)jhon smith
 
Brighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content MarketersBrighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content MarketersTom Bennet
 
Training Project Report on Search Engines
Training Project Report on Search EnginesTraining Project Report on Search Engines
Training Project Report on Search EnginesShivam Saxena
 
Google search architecture services in Hyderabad
Google search architecture services in HyderabadGoogle search architecture services in Hyderabad
Google search architecture services in HyderabadMartin James
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniquesNirav Ranpara
 
SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012451 Marketing
 
Week 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworksWeek 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworkscarolyn oldham
 
What is a canonical tag?
What is a canonical tag?What is a canonical tag?
What is a canonical tag?Abhishek Mitra
 

What's hot (20)

About search engines
About search enginesAbout search engines
About search engines
 
Search engine
Search engineSearch engine
Search engine
 
Search Engines
Search EnginesSearch Engines
Search Engines
 
How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works report
 
Google ppt by amit
Google ppt by amitGoogle ppt by amit
Google ppt by amit
 
Search engine
Search engineSearch engine
Search engine
 
Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...
 
Comparing Search Engines
Comparing Search EnginesComparing Search Engines
Comparing Search Engines
 
Search engine optimization (seo)
Search engine optimization (seo)Search engine optimization (seo)
Search engine optimization (seo)
 
Brighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content MarketersBrighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content Marketers
 
Training Project Report on Search Engines
Training Project Report on Search EnginesTraining Project Report on Search Engines
Training Project Report on Search Engines
 
Google search architecture services in Hyderabad
Google search architecture services in HyderabadGoogle search architecture services in Hyderabad
Google search architecture services in Hyderabad
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniques
 
Search Engine Google
Search Engine GoogleSearch Engine Google
Search Engine Google
 
Lvr ppt
Lvr pptLvr ppt
Lvr ppt
 
SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012SEO 101 webinar 10 25-2012
SEO 101 webinar 10 25-2012
 
SEO Animals
SEO AnimalsSEO Animals
SEO Animals
 
Week 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworksWeek 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworks
 
What is a canonical tag?
What is a canonical tag?What is a canonical tag?
What is a canonical tag?
 
Search Engine
Search EngineSearch Engine
Search Engine
 

Viewers also liked

Clinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David RoeselClinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David RoeselUWGlobalHealth
 
Understanding search engine algorithms
Understanding search engine algorithmsUnderstanding search engine algorithms
Understanding search engine algorithmsVijay Sankar
 
The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?Kundan Bhaduri
 
Google Search Engine
Google Search EngineGoogle Search Engine
Google Search Engineguestf460ed0
 
Page rank algorithm
Page rank algorithmPage rank algorithm
Page rank algorithmJunghoon Kim
 
Google Page Rank Algorithm
Google Page Rank AlgorithmGoogle Page Rank Algorithm
Google Page Rank AlgorithmOmkar Dash
 
Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013Bill Hartzer
 
Google hummingbird algorithm ppt
Google hummingbird algorithm pptGoogle hummingbird algorithm ppt
Google hummingbird algorithm pptPriyodarshini Dhar
 
Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explainedjdhaar
 

Viewers also liked (12)

Clinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David RoeselClinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David Roesel
 
Google algorithim’s
Google  algorithim’sGoogle  algorithim’s
Google algorithim’s
 
Understanding search engine algorithms
Understanding search engine algorithmsUnderstanding search engine algorithms
Understanding search engine algorithms
 
PageRank
PageRankPageRank
PageRank
 
The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?
 
Google Search Engine
Google Search EngineGoogle Search Engine
Google Search Engine
 
Page rank algorithm
Page rank algorithmPage rank algorithm
Page rank algorithm
 
Google PageRank
Google PageRankGoogle PageRank
Google PageRank
 
Google Page Rank Algorithm
Google Page Rank AlgorithmGoogle Page Rank Algorithm
Google Page Rank Algorithm
 
Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013
 
Google hummingbird algorithm ppt
Google hummingbird algorithm pptGoogle hummingbird algorithm ppt
Google hummingbird algorithm ppt
 
Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explained
 

Similar to How Google Search Engine Algorithm Works ??

Search Engine Optimization (Seo)
Search Engine Optimization (Seo)Search Engine Optimization (Seo)
Search Engine Optimization (Seo)ssunnysengar
 
Google Search Engine
Google Search Engine Google Search Engine
Google Search Engine Aniket_1415
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine Optimizationshrishail uttagi
 
Search Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEOSearch Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEONeeraj Reddy
 
Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Nate Plaunt
 
The Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineThe Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineManish Chopra
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawlerishmecse13
 
Effective Searching Policies for Web Crawler
Effective Searching Policies for Web CrawlerEffective Searching Policies for Web Crawler
Effective Searching Policies for Web CrawlerIJMER
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search EnginesJohan Koren
 
How google works and functions: A complete Approach
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete ApproachPrakhar Gethe
 
Demand Quest SEO training session 2
Demand Quest SEO training session 2Demand Quest SEO training session 2
Demand Quest SEO training session 2Nate Plaunt
 
Latest Updates on SEO
Latest Updates on SEOLatest Updates on SEO
Latest Updates on SEOshailaja100
 
Search Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week ThreeSearch Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week Threepaulwould
 
Basic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must KnowBasic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must Knowwaqas ahmad
 
Il processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo RamazzottiIl processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo RamazzottiPaolo Ramazzotti
 
Crawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo RamazzottiCrawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo RamazzottiGimasi Sa
 
The ultimate guide to the invisible web
The ultimate guide to the invisible webThe ultimate guide to the invisible web
The ultimate guide to the invisible webYKNIB O
 

Similar to How Google Search Engine Algorithm Works ?? (20)

Search Engine Optimization (Seo)
Search Engine Optimization (Seo)Search Engine Optimization (Seo)
Search Engine Optimization (Seo)
 
Google Search Engine
Google Search Engine Google Search Engine
Google Search Engine
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine Optimization
 
Seo Manual
Seo ManualSeo Manual
Seo Manual
 
Search Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEOSearch Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEO
 
Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2Demand Quest SEO Training - Session 2
Demand Quest SEO Training - Session 2
 
The Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineThe Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search Engine
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawler
 
Effective Searching Policies for Web Crawler
Effective Searching Policies for Web CrawlerEffective Searching Policies for Web Crawler
Effective Searching Policies for Web Crawler
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search Engines
 
Search engine
Search engineSearch engine
Search engine
 
How google works and functions: A complete Approach
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete Approach
 
Demand Quest SEO training session 2
Demand Quest SEO training session 2Demand Quest SEO training session 2
Demand Quest SEO training session 2
 
Latest Updates on SEO
Latest Updates on SEOLatest Updates on SEO
Latest Updates on SEO
 
Search Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week ThreeSearch Engine Optimisation - MA Journalism - Week Three
Search Engine Optimisation - MA Journalism - Week Three
 
Basic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must KnowBasic SEO Techniques All Webmasters Must Know
Basic SEO Techniques All Webmasters Must Know
 
Search engine
Search engineSearch engine
Search engine
 
Il processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo RamazzottiIl processo di Crawilng e Indexing di Google - Paolo Ramazzotti
Il processo di Crawilng e Indexing di Google - Paolo Ramazzotti
 
Crawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo RamazzottiCrawling, Indicizzazione e SEO - Paolo Ramazzotti
Crawling, Indicizzazione e SEO - Paolo Ramazzotti
 
The ultimate guide to the invisible web
The ultimate guide to the invisible webThe ultimate guide to the invisible web
The ultimate guide to the invisible web
 

Recently uploaded

Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 

How Google Search Engine Algorithm Works ??

  • 1. How Google search engine algorithm works Prepared by:- Viral Shah (120570107014) Guided by :- Prof. Sahista Machhar, MEFGI
  • 2. It is a program that searches for and identifies items in a database that correspond to keywords or characters specified by the user, used especially for finding particular sites on the World Wide Web.
  • 3.  There are 759 Million websites on the Web & 60 Trillion webpages of this websites.  AND IT’S CONSTANTLY GROWING !!!!!
  • 4.  GOOGLE navigates WEB by crawling.  To find information on the hundreds of millions of Web pages that exist, a search engine employs special software robots, called SPIDERS, to build lists of the words found on Web sites. When a spider is building its lists, the process is called Web crawling.
  • 5.  The usual starting points are lists of heavily used servers and very popular pages. The spider will begin with a popular site, indexing the words on its pages and following every link found within the site. In this way, the spidering system quickly begins to travel, spreading out across the most widely used portions of the Web.
  • 6.  When the Google spider looked at an HTML page, it took note of following things:- Words occurring in the title, subtitles, meta tags and other positions of relative importance were noted for special consideration during a subsequent user search. The Google spider was built to index every significant word on a page, leaving out the articles “a”, “an” and "the”. Other spiders take different approaches.  For example, some spiders will keep track of the words in the title, sub-headings and links, along with the 100 most frequently used words on the page and each word in the first 20 lines of text. Lycos is said to use this approach to spidering the Web.  GOOGLE built their initial system to use multiple spiders, usually three at one time. Each spider could keep about 300 connections to Web pages open at a time.
  • 7.  Google’s spider name is Googlebot.  Googlebot is the search bot software used by Google, which collects documents from the web to build a searchable index for the Google Search engine.
  • 8.  By following the web-pages, INDEX is prepared. The index includes text from millions of books from several libraries and other partners.  That means GOOGLE follow links from page to page. Also they sort pages by their content and other factors.
  • 9.  These all activities Google carry out is tracked in the INDEX. Google continuously updates index and it is stored over large servers.  Currently, Google’s Index size is over 100 million Gigabyte.
  • 10.  Site owners choose whether their sites are crawled.  To prevent most search engine web crawlers from indexing a page on your site, place the following meta tag into the<head> section of your page: <meta name="robots" content="noindex">  To prevent only Google web crawlers from indexing a page: <meta name="googlebot" content="noindex">
  • 11. 1) AUTOCOMPLETE Predicts what you might be searching for. This includes understanding terms with more than one meaning. 2) SYNONYMS Recognizes words with similar meanings.
  • 12. 3) QUERY UNDERSTANDING Gets to the deeper meaning of the words you type. 4) GOOGLE INSTANT Displays immediate results as you type. 5) SPELLING Identifies and corrects possible spelling errors and provides alternatives.
  • 13.  Based on all the above factors, Google picks some web-pages from the index.  Then, Google ranks the result on various factors.  1) Site & Page Quality:- It is checked by how you are writing key-words.
  • 14. 2) Freshness:- How much fresh the content is & at how much regular interval it is updated !! 3) Safe-Search:- Google tries to find out how much it is safe and doesn’t contains spams. Along with these, there are 200+ factors used by Google to rank any particular webs-page.
  • 15.  After all these operations, you will get the desired result and these all happens in one nano-second !!!
  • 16.  Google fights with spam every second to give true & relevant result.  The majority of spam removal is automatic. Google examine other questionable documents by hand. If Google find spam, they take manual action.
  • 17. 1) PURE SPAM Site appears to use aggressive spam techniques such as automatically generated gibberish, cloaking, scraping content from other websites, and/or repeated or egregious violations of Google's Webmaster Guidelines. 2) HIDDEN TEXT AND/OR KEYWORD STUFFING Some of the pages may contain hidden text and/or keyword stuffing.
  • 18. 3) USER-GENERATED SPAM Site appears to contain spammy user-generated content. The problematic content may appear on forum pages, guestbook pages, or user profiles. 4) PARKED DOMAINS Parked domains are placeholder sites with little unique content, so Google doesn't typically include them in search results.
  • 19. 5) THIN CONTENT WITH LITTLE OR NO ADDED VALUE Site appears to consist of low-quality or shallow pages which do not provide users with much added value (such as thin affiliate pages, doorway pages, cookie-cutter sites, automatically generated content, or copied content). 6) UNNATURAL LINKS TO A SITE Google has detected a pattern of unnatural artificial, deceptive or manipulative links pointing to the site. These may be the result of buying links that pass PageRank or participating in link schemes.
  • 20.  Besides these all there are thousands other factors Google uses to detect Spam and decides the page-rank of web-page accordingly which is constantly updated and finally Google only keeps trusted documents in index.
  • 21.  And the point of Interest is that to make presentation on google, I used
  • 22.  Behind your simple page of results is a complex system, carefully crafted and tested, to support more than one-hundred billion searches each month !!!! 