Your SlideShare is downloading. ×
0
Introduction to Search Engines
Introduction to Search Engines
Introduction to Search Engines
Introduction to Search Engines
Introduction to Search Engines
Introduction to Search Engines
Introduction to Search Engines
Introduction to Search Engines
Introduction to Search Engines
Introduction to Search Engines
Introduction to Search Engines
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Introduction to Search Engines

771

Published on

Gives a brief introduction on how a search engine works

Gives a brief introduction on how a search engine works

0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
771
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
69
Comments
0
Likes
1
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Describes how a basic search engine works.<br />How a Search Engine Works<br />Reehaz Soobhany (0920302)<br />Strategic e-Marketing<br />University of Mauritius 2010<br />
  • 2. Search Engines Introduction<br />Everyone who uses the internet today surely uses a search engine.<br />Several types of search engines<br />Crawler Based (Google, Yahoo)<br />Human Directories (Open Directory, Yahoo!Directory)<br />Hybrid<br />Meta Search Engine (Ask.com)<br />
  • 3. Crawler Based Search Engine<br />Core Operations:<br />Web Crawling (aka the spider) – follows every link in a page recursively and downloads the page<br />Indexing – Creates the inverted file<br />Searching – Searches through the inverted (indexed file according to the query of the user<br />
  • 4. Indexing<br />Normalize Documents<br />Deletes stop words<br />Stem words<br />Create index entries<br />Calculate weights<br />Updates inverted file<br />
  • 5. Document Normalization<br />&lt;H1&gt;<br />This is a Heading Level One<br />&lt;/H1&gt;<br />Case Folding<br />&lt;h1&gt;<br />this is a heading level one<br />&lt;/h1&gt;<br />Extract Core document text from file<br />this is a heading level one<br />
  • 6. Delete Stop Words<br />Stop words are words which do not have little value is finding a relevant document. Example of stop words are :<br />A, are, is, when, how…<br />Helps save resources and also not create to big and irrelevant indexes<br />heading level one<br />
  • 7. Word Stemming &amp; Index Entries<br />Word stemming removes the suffixes from words<br />Add efficiency to the index file<br />Also match the meaning rather than the exact word<br />inflectional suffixes (-s, -es, -ed)<br />derivational suffixes (-ing, -able, -aciousness, -ability)<br />headlevelone<br />
  • 8. Calculate Weights<br />Usually a secret algorithm of the search engine<br />Some typical scheme used:<br />Placement in a document (a word in a heading level 1 will have a greater weight than one at heading level 2 or a normal text)<br />The number of other documents which refers to this document<br />If by authoritative writing<br />
  • 9. Creates or Update the Inverted File<br />
  • 10. Query Processor<br />When the user type a query in the search engine, the search engine recognises the terms and operators<br />Runs the query against the inverted file<br />Ranks the result. Again the secret algorithm of the search engine. Uses the weights on each word<br />Return the results to the user.<br />Voila <br />
  • 11. Thank You<br />

×