SlideShare a Scribd company logo
1 of 18
Search Engines
2
What Are They?
 Four Components
 A database of references to webpages
 An indexing robot that crawls the WWW
 An interface
 Enables users to submit queries
 Displays results
 Information retrieval system
 Each is unique, but are mostly the same
3
Database
 Where user's query is matched
 Contains only essential parts of pages
 Only includes pages that were indexed
 Search engines are always out of date
4
Web Crawler
 A robot that follows links
 Records data it finds
 Words in the webpage
 Metadata
 ALT attributes in IMG tags
 Robot Exclusion Protocol
5
Search Engine Interfaces
 Gathers input from users
 Presents results from the IR system
 Often in ranked order
6
Search Engine Interfaces
 Input
 User requirements
 Search expression, search limits
 Presentation style
 Presentation format , search type
7
Search Engine Interfaces
 Output
 Results
 Descriptions
 Clusters
8
Search Term Matching
 Trying to find a match in the database
 Two main methods
 Keyword searching
 Matching single terms, computing cosine
 Concept-based searching
 Examining clusters of words
 Attempt to determine meaning of query and find
records related to that meaning
9
Basic IR Features
 Boolean operators
 AND, OR, NOT, grouping
 Extended operators
 NEAR, ADJACENT, (")
 Stop word deletion
 Stemming
 Searching in fields (e.g. host)
10
Ranked Output
 Most SEs produce ranked lists by applying
simple rules:
 Early words are more important
 Title is very important
 Frequency of occurrence matters for some
 Infrequent words matter more
 Modification date
 Google is different:
 PageRankTM
method based on popularity
 Links as money
11
Googlebombing
 Google spoofed from the lecture list
 first hit from 1992
 Official GoogleBlog explanation
12
What about the Invisible Web?
 Also known as the Deep Web
 Documents that are on the WWW but
not indexed by Search Engines
 Some are available only by submitting
forms
 Some are not generally accessible (in
subnets)
 Some are not in (X)HTML format
13
The Invisible Web Isn't So
Invisible Anymore…
 More search engines parse non-
(X)HTML now than before
 Because of awareness of the problem
companies are making more content
available using
 Stable URLs
 Robot-friendly sitemaps
 But much content is still not indexed
14
But, there's still plenty of
important yet invisible docs
 How to find them?
 Many of them are in databases
 No one search engine covers everything
 Use database tools from the U.'s library
 Especially for research articles
 Use multiple search engines or a meta-
crawler
 dogpile is the most famous
Search Engines
A Summary of Practical Advice
16
How To Succeed With SEs
 As a surfer:
 If you don't know what you are looking for
 Use multiple SEs, or a meta-crawler
 Search within results
 If you don't know what you are looking for
 Use multiple SEs, or a meta-crawler
 Use Boolean expressions or search within
results
 Consider specialized engines
17
How To Succeed With SEs
 As a creator:
 HTML level
 Always use ALT attributes with <IMG>, etc.
 Avoid frames
 Make it easier to index
 Don't expect SEs to find your pages
 Make links between your pages
 Use metadata
 Informal: <meta name="description" …>
 Formal: Dublin core and others
 Increase your pages popularity
 Don’t use systematic reciprocal linking: rings, exchanges, lists
 Page Rank™ is inversely proportional to outdegree
18
How To Succeed With SEs
 As a creator (cont.)
 For surfers:
 Use <meta name="description" …>
 Don't expect surfers to start at top of your
hierarchy
 Don't rely on a hierarchy
 Include a context map near the top of each page
 Don't use frames
 Think through dynamic content implications
 Stickiness… is for another day

More Related Content

Similar to Search Engines.ppt

Seo search engine optimization ppt
Seo search engine optimization pptSeo search engine optimization ppt
Seo search engine optimization pptankush bansal
 
Search Engines Other than Google
Search Engines Other than GoogleSearch Engines Other than Google
Search Engines Other than GoogleDr Trivedi
 
Seo Beginners Slide Show
Seo Beginners Slide ShowSeo Beginners Slide Show
Seo Beginners Slide ShowTin180 VietNam
 
Seo beginners-slide-show
Seo beginners-slide-showSeo beginners-slide-show
Seo beginners-slide-showAnkush77721
 
Seo beginners-slide-show
Seo beginners-slide-showSeo beginners-slide-show
Seo beginners-slide-showKrunal Doshi
 
Practical SEO for Developers - An Introduction
Practical SEO for Developers - An IntroductionPractical SEO for Developers - An Introduction
Practical SEO for Developers - An IntroductionNoel Flowers
 
SEO: search Engine Optimization
SEO: search Engine OptimizationSEO: search Engine Optimization
SEO: search Engine Optimizationphoolchand yadav
 
Introduction to the Search Engine Optimization
Introduction to the Search Engine OptimizationIntroduction to the Search Engine Optimization
Introduction to the Search Engine OptimizationMario Prisciandaro
 
Beginner's Guide to SEO [Technical SEO & On Page]
Beginner's Guide to SEO [Technical SEO & On Page]Beginner's Guide to SEO [Technical SEO & On Page]
Beginner's Guide to SEO [Technical SEO & On Page]Boni Satani
 
Technical SEO Audit – 15 Point Checklist
Technical SEO Audit – 15 Point ChecklistTechnical SEO Audit – 15 Point Checklist
Technical SEO Audit – 15 Point ChecklistNavneet Singh
 
Introduction to search engine optimzation
Introduction to search engine optimzationIntroduction to search engine optimzation
Introduction to search engine optimzationPrashant Sharma
 
Understanding & Using Search Engine Optimization
Understanding & Using Search Engine OptimizationUnderstanding & Using Search Engine Optimization
Understanding & Using Search Engine OptimizationifPeople
 
Search Engine Optimization Tips: SEO Tips For Beginners in 2015
Search Engine Optimization Tips: SEO Tips For Beginners in 2015Search Engine Optimization Tips: SEO Tips For Beginners in 2015
Search Engine Optimization Tips: SEO Tips For Beginners in 2015waqas ahmad
 

Similar to Search Engines.ppt (20)

Seo ppt
Seo pptSeo ppt
Seo ppt
 
Presentationjava
PresentationjavaPresentationjava
Presentationjava
 
seo-ppt.pptx
seo-ppt.pptxseo-ppt.pptx
seo-ppt.pptx
 
Seo search engine optimization ppt
Seo search engine optimization pptSeo search engine optimization ppt
Seo search engine optimization ppt
 
Search Engines Other than Google
Search Engines Other than GoogleSearch Engines Other than Google
Search Engines Other than Google
 
Seo Beginners Slide Show
Seo Beginners Slide ShowSeo Beginners Slide Show
Seo Beginners Slide Show
 
Seo beginners
Seo beginners Seo beginners
Seo beginners
 
Seo beginners-slide-show
Seo beginners-slide-showSeo beginners-slide-show
Seo beginners-slide-show
 
Seo beginners-slide-show
Seo beginners-slide-showSeo beginners-slide-show
Seo beginners-slide-show
 
Practical SEO for Developers - An Introduction
Practical SEO for Developers - An IntroductionPractical SEO for Developers - An Introduction
Practical SEO for Developers - An Introduction
 
Basic SEO Lecture Presentation
Basic SEO Lecture PresentationBasic SEO Lecture Presentation
Basic SEO Lecture Presentation
 
SEO: search Engine Optimization
SEO: search Engine OptimizationSEO: search Engine Optimization
SEO: search Engine Optimization
 
Introduction to the Search Engine Optimization
Introduction to the Search Engine OptimizationIntroduction to the Search Engine Optimization
Introduction to the Search Engine Optimization
 
Beginner's Guide to SEO [Technical SEO & On Page]
Beginner's Guide to SEO [Technical SEO & On Page]Beginner's Guide to SEO [Technical SEO & On Page]
Beginner's Guide to SEO [Technical SEO & On Page]
 
Technical SEO Audit – 15 Point Checklist
Technical SEO Audit – 15 Point ChecklistTechnical SEO Audit – 15 Point Checklist
Technical SEO Audit – 15 Point Checklist
 
DotNetNuke SEO
DotNetNuke SEODotNetNuke SEO
DotNetNuke SEO
 
Introduction to search engine optimzation
Introduction to search engine optimzationIntroduction to search engine optimzation
Introduction to search engine optimzation
 
Understanding & Using Search Engine Optimization
Understanding & Using Search Engine OptimizationUnderstanding & Using Search Engine Optimization
Understanding & Using Search Engine Optimization
 
concepts of SEO
concepts of SEOconcepts of SEO
concepts of SEO
 
Search Engine Optimization Tips: SEO Tips For Beginners in 2015
Search Engine Optimization Tips: SEO Tips For Beginners in 2015Search Engine Optimization Tips: SEO Tips For Beginners in 2015
Search Engine Optimization Tips: SEO Tips For Beginners in 2015
 

Recently uploaded

Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 

Recently uploaded (20)

Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 

Search Engines.ppt

  • 2. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables users to submit queries  Displays results  Information retrieval system  Each is unique, but are mostly the same
  • 3. 3 Database  Where user's query is matched  Contains only essential parts of pages  Only includes pages that were indexed  Search engines are always out of date
  • 4. 4 Web Crawler  A robot that follows links  Records data it finds  Words in the webpage  Metadata  ALT attributes in IMG tags  Robot Exclusion Protocol
  • 5. 5 Search Engine Interfaces  Gathers input from users  Presents results from the IR system  Often in ranked order
  • 6. 6 Search Engine Interfaces  Input  User requirements  Search expression, search limits  Presentation style  Presentation format , search type
  • 7. 7 Search Engine Interfaces  Output  Results  Descriptions  Clusters
  • 8. 8 Search Term Matching  Trying to find a match in the database  Two main methods  Keyword searching  Matching single terms, computing cosine  Concept-based searching  Examining clusters of words  Attempt to determine meaning of query and find records related to that meaning
  • 9. 9 Basic IR Features  Boolean operators  AND, OR, NOT, grouping  Extended operators  NEAR, ADJACENT, (")  Stop word deletion  Stemming  Searching in fields (e.g. host)
  • 10. 10 Ranked Output  Most SEs produce ranked lists by applying simple rules:  Early words are more important  Title is very important  Frequency of occurrence matters for some  Infrequent words matter more  Modification date  Google is different:  PageRankTM method based on popularity  Links as money
  • 11. 11 Googlebombing  Google spoofed from the lecture list  first hit from 1992  Official GoogleBlog explanation
  • 12. 12 What about the Invisible Web?  Also known as the Deep Web  Documents that are on the WWW but not indexed by Search Engines  Some are available only by submitting forms  Some are not generally accessible (in subnets)  Some are not in (X)HTML format
  • 13. 13 The Invisible Web Isn't So Invisible Anymore…  More search engines parse non- (X)HTML now than before  Because of awareness of the problem companies are making more content available using  Stable URLs  Robot-friendly sitemaps  But much content is still not indexed
  • 14. 14 But, there's still plenty of important yet invisible docs  How to find them?  Many of them are in databases  No one search engine covers everything  Use database tools from the U.'s library  Especially for research articles  Use multiple search engines or a meta- crawler  dogpile is the most famous
  • 15. Search Engines A Summary of Practical Advice
  • 16. 16 How To Succeed With SEs  As a surfer:  If you don't know what you are looking for  Use multiple SEs, or a meta-crawler  Search within results  If you don't know what you are looking for  Use multiple SEs, or a meta-crawler  Use Boolean expressions or search within results  Consider specialized engines
  • 17. 17 How To Succeed With SEs  As a creator:  HTML level  Always use ALT attributes with <IMG>, etc.  Avoid frames  Make it easier to index  Don't expect SEs to find your pages  Make links between your pages  Use metadata  Informal: <meta name="description" …>  Formal: Dublin core and others  Increase your pages popularity  Don’t use systematic reciprocal linking: rings, exchanges, lists  Page Rank™ is inversely proportional to outdegree
  • 18. 18 How To Succeed With SEs  As a creator (cont.)  For surfers:  Use <meta name="description" …>  Don't expect surfers to start at top of your hierarchy  Don't rely on a hierarchy  Include a context map near the top of each page  Don't use frames  Think through dynamic content implications  Stickiness… is for another day