Upcoming SlideShare
×

1,849 views
1,715 views

Published on

Published in: Education, Technology, Design
1 Like
Statistics
Notes
• Full Name
Comment goes here.

Are you sure you want to Yes No

Are you sure you want to  Yes  No
• sir would u mind describing d things archie,gopher,jughead,wandex in breif

Are you sure you want to  Yes  No
Views
Total views
1,849
On SlideShare
0
From Embeds
0
Number of Embeds
26
Actions
Shares
0
148
2
Likes
1
Embeds 0
No embeds

No notes for slide

1. 1. RUCHIR KUMAR GUPTA BTECH FINAL YEAR(I.T.) 0701013084
2. 2. CONTENTS INTRODUCTION HISTORY WORKING TYPES OF SERCH ENGINE ADVANTAGES CONCLUSION
3. 3. INTRODUCTION Search engine is a software program that searches for sites based on the words that you designate as search terms. Search engines look through their own databases of information in order to find what it is that you are looking for. “Search engine” is the popular term for an Information Retrieval (IR) system.
4. 4. HISTORY Archie – First search tool for the Internet Gopher – indexed plain text documents Jughead – searched the files stored in Gopher index system. Wandex – first Web search engine
5. 5. How do Search Engine Works  Spiders  Robots
6. 6. FORMULA Larry Page and Sergin Brin at Stanford University gave it.  PA=(1-d)+d (PR(T1))/C (T1)+…+PR (Tn)/C (Tn) Where  PR(A): Page Rank of page A  PR(Ti): Page Rank of pages Ti which link to page A.  C(Ti): the no of outbound link on page Ti d: damping factor in the range of 0<d<1,usually set to 0.85. The highest the PR, the better the inbound link, so it would seem.
7. 7. APPROXIMATING RELEVANCE  Statistical similarity is used to estimate the relevance of an query to answer.  Consider the query MAHENDRA SINGH DHONI 1) A good answer contains all three words,and more frequently the better,we called this term frequency (TF). 2) Some query terms are are more important-have the discriminatory power-better than others.
8. 8. Resolving a Query Consider ( cat hat mat ) Select a word from query ( “cat” ) Retrieve the list for the word cat Find the best ranked document and look up the mapping table. Retrieve and Summarize the docs.
9. 9. Search Engine Modules : A document processor A query processor A search and matching function A ranking capability  Summarizing and Presenting documents.
10. 10. TYPES OF SEARCH ENGINE Crawler-Based Search Engines Human-Powered Directories Hybrid Search Engines" Or Mixed Results
11. 11. Google use spiders Large index of keywords. Google’s PAGE RANK . 1.frequency and location of keywords within the Web page 2. Web page history. 3. number of other Web pages that link to the page in question
12. 12. ADVANTAGES  It organizes the world ‘s information.  It makes the information universally accessible.  It makes the information useful.  Helps people to interact one another.  Helps to know about the products and goods.  Helps the people to know about the society.  Any information can be gathered from google.
13. 13. CONCLUSION Search engine plays important role in accessing the content over the internet, it fetches the pages requested by the user.  It made the internet and accessing the information just a click away. The need for better search engines only increases The search engine sites are among the most popular websites.