1. How Internet Search
Engines Work
[Group H] member
s1160001 Takatoshi Akiyama
s1160006 Taishi Abe
s1160007 Yuki Abe
2. Crawler
A web crawler (also known as a web spider
or web robot) is a program or automated
script which browses the World Wide Web
in a methodical, automated manner.
Other less frequently used names for web
crawlers are ants, automatic indexers, bots,
and worms
3. Spider
A program that automatically fetches Web pages.
Spiders are used to feed pages to search engines.
It's called a spider because it crawls over the Web.
Another term for these programs is Web crawler.
Because most Web pages contain links to other
pages, a spider can start almost anywhere. As
soon as it see a link to another page, it goes off
and fetches it. Large search engines.
4. Search Engine
Search Engine means the Web site that can search
information shown on the Internet with keyword. It
can classify to two kinds of the directory type
classified according to full text search type and a
category to search by keyword.
5. Indexing Software
Researchers have been trying for many years to
develop linguistic processing systems to analyze
text and automatically produce an index of the
contents.
Some progress has begun to be made for
specialist technical documents, written in very
formal language, but for normal text results are still
haphazard at best and so much and so cannot be
recommended for professional publishing.