Week12presentation

How Internet Search
Engines Work

[Group H] member
s1160001 Takatoshi Akiyama
s1160006 Taishi Abe
s1160007 Yuki Abe

Crawler

A web crawler (also known as a web spider
or web robot) is a program or automated
script which browses the World Wide Web
in a methodical, automated manner.
Other less frequently used names for web
crawlers are ants, automatic indexers, bots,
and worms

Spider
A program that automatically fetches Web pages.
Spiders are used to feed pages to search engines.
It's called a spider because it crawls over the Web.
Another term for these programs is Web crawler.

Because most Web pages contain links to other
pages, a spider can start almost anywhere. As
soon as it see a link to another page, it goes off
and fetches it. Large search engines.

Search Engine

Search Engine means the Web site that can search
information shown on the Internet with keyword. It
can classify to two kinds of the directory type
classified according to full text search type and a
category to search by keyword.

Indexing Software

Researchers have been trying for many years to
develop linguistic processing systems to analyze
text and automatically produce an index of the
contents.
Some progress has begun to be made for
specialist technical documents, written in very
formal language, but for normal text results are still
haphazard at best and so much and so cannot be
recommended for professional publishing.

References

>Spider
http://www.webopedia.com/TERM/s/spide
r.html
>society of indexers
http://www.indexers.org.uk/index.php?id
=211
>crawler
http://en.wikipedia.org/wiki/Web_crawler

Week12presentation

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (15)

Similar to Week12presentation

Similar to Week12presentation (20)

Recently uploaded

Recently uploaded (20)

Week12presentation