SlideShare a Scribd company logo
1 of 3
How different search engines work


Web Crawlers:
  •   A Web crawler is a computer program that browses the World
      Wide Web in a methodical, automated manner or in an orderly
      fashion. Web crawlers are mainly used to create a copy of all
      the visited pages for later processing by a search engine that
      will index the downloaded pages to provide fast searches.
      Crawlers can also be used for automating maintenance tasks on
      a Web site, such as checking links or validating HTML code.
      Also, crawlers can be used to gather specific types of
      information from Web pages, such as harvesting e-mail
      addresses, usually for spam.

Standard Diagram of Web Crawler
Examples of Web Crawler:

  •   Yahoo! Slurp
  •   Msnbot
  •   FAST Crawler
  •   Googlebot
  •   Methabot
  •   PolyBot
  •   RBSE Web.
  •   WebCrawler
  •   Web Fountain
  •   WebRACE




  Web Directory:


  A web directory or link directory is a directory on the World Wide
  Web. It specializes in linking to other web sites and categorizing
  those links. A web directory is not a search engine and does not
  display lists of web pages based on keywords; instead, it lists web
  sites by category and subcategory.

  Examples of well known web directories are Yahoo! Directory and
  the Open Directory Project (ODP). ODP is significant due to its
  extensive categorization and large number of listings and its free
  availability for use by other directories and search engines.

  Web Directories Features:

  •   Free submission – there is no charge for the review and listing
      of the site
  •   Reciprocal link – a link back to the directory must be added
      somewhere on the submitted site in order to get listed in the
      directory
  •   Paid submission – a one-time or recurring fee is charged for
      reviewing/listing the submitted link
•   No follow – there is a rel="nofollow" attribute associated
    with the link, meaning search engines will give no weight to the
    link.
•   Featured listing – the link is given a premium position in a
    category (or multiple categories) or other sections of the
    directory, such as the homepage. Sometimes called sponsored
    listing.
•   Bid for position – where sites are ordered based on bids
•   Affiliate links – where the directory earns commission for
    referred customers from the listed websites

More Related Content

More from Stuart Bathurst RC High School (12)

Ao5 evidence of charts and graphs
Ao5 evidence of charts and graphsAo5 evidence of charts and graphs
Ao5 evidence of charts and graphs
 
Ao4 collected data spreadsheet
Ao4 collected data spreadsheetAo4 collected data spreadsheet
Ao4 collected data spreadsheet
 
Ao4 evidence for my spreadsheet
Ao4 evidence for my spreadsheetAo4 evidence for my spreadsheet
Ao4 evidence for my spreadsheet
 
Ao3 collected data spreadsheet template
Ao3 collected data spreadsheet templateAo3 collected data spreadsheet template
Ao3 collected data spreadsheet template
 
Ao2 data collection and sampling
Ao2 data collection and samplingAo2 data collection and sampling
Ao2 data collection and sampling
 
Ao2 computer and internet access questionnaire
Ao2 computer and internet access questionnaireAo2 computer and internet access questionnaire
Ao2 computer and internet access questionnaire
 
Ao1 collecting background information
Ao1 collecting background informationAo1 collecting background information
Ao1 collecting background information
 
Ao6 Presentation of findings
Ao6 Presentation of  findingsAo6 Presentation of  findings
Ao6 Presentation of findings
 
Unit 3 -_problem_solving
Unit 3 -_problem_solvingUnit 3 -_problem_solving
Unit 3 -_problem_solving
 
Sam magazine
Sam magazineSam magazine
Sam magazine
 
Unit 2 team plan
Unit 2 team planUnit 2 team plan
Unit 2 team plan
 
Task 7
Task 7Task 7
Task 7
 

How different search engines work

  • 1. How different search engines work Web Crawlers: • A Web crawler is a computer program that browses the World Wide Web in a methodical, automated manner or in an orderly fashion. Web crawlers are mainly used to create a copy of all the visited pages for later processing by a search engine that will index the downloaded pages to provide fast searches. Crawlers can also be used for automating maintenance tasks on a Web site, such as checking links or validating HTML code. Also, crawlers can be used to gather specific types of information from Web pages, such as harvesting e-mail addresses, usually for spam. Standard Diagram of Web Crawler
  • 2. Examples of Web Crawler: • Yahoo! Slurp • Msnbot • FAST Crawler • Googlebot • Methabot • PolyBot • RBSE Web. • WebCrawler • Web Fountain • WebRACE Web Directory: A web directory or link directory is a directory on the World Wide Web. It specializes in linking to other web sites and categorizing those links. A web directory is not a search engine and does not display lists of web pages based on keywords; instead, it lists web sites by category and subcategory. Examples of well known web directories are Yahoo! Directory and the Open Directory Project (ODP). ODP is significant due to its extensive categorization and large number of listings and its free availability for use by other directories and search engines. Web Directories Features: • Free submission – there is no charge for the review and listing of the site • Reciprocal link – a link back to the directory must be added somewhere on the submitted site in order to get listed in the directory • Paid submission – a one-time or recurring fee is charged for reviewing/listing the submitted link
  • 3. No follow – there is a rel="nofollow" attribute associated with the link, meaning search engines will give no weight to the link. • Featured listing – the link is given a premium position in a category (or multiple categories) or other sections of the directory, such as the homepage. Sometimes called sponsored listing. • Bid for position – where sites are ordered based on bids • Affiliate links – where the directory earns commission for referred customers from the listed websites