Slideshow transcript
Slide 1: CHAPTER 4 Search Engines: The Basics
Slide 2: 1 Search engine overview 2 Searching options 3 Specialty search engines 4 Meta search engines
Slide 3: 5 Search engine shortcuts 6 Mashups 7 Desktop search programs 8 Keeping up-to-date
Slide 4: • Larger Search engine • No intervention and web • Searching not browsing directories
Slide 5: • Service on the web that allows What is a searching of a large database of web search pages by word, phrase, and other engine? criteria • May provide other services as translation, shopping, … etc • Any site containing search box could be considered to have a search engine
Slide 6: Four major steps are involved: How search – Spiders (crawlers) engines are – Indexing program & index created – Search engine program – HTML user interface
Slide 7: • Used by search engines to: Spiders – Identify new sites or updated sites – Gather information from them (crawlers) – Feed them to the indexing program
Slide 8: • In each new identified page it will Indexing index: program & – Every word index – URL – Meta tags – URLs of links on the page – Image file names
Slide 9: • Retrieves the user query Search • Arranges the retrieved records engine (relevance ranking algorithm) program • Factors affecting relevance ranking algorithm: – Page popularity (by # of pages linked to it) – Number of key words occurrence – Proximity to search terms – Location of searched terms
Slide 10: • Home page for the search service HTML user and advanced search service interface • Contains search box • Contains links to various databases (images, news, etc)
Slide 11: • Varies from search engine to other Search • Some are in the home page, others options in advanced search page • Available in: – Menu approach – Prefix approach
Slide 12: • Phrase searching Examples of • Language specification search • File type options • Specifying the retrieved term location: – Title – URL – Links • Boolean operations
Slide 13: • Available in every search engine Phrase • Done the same way e: all search s in engine for a phra searching rch ation • To sea engines have a uot only of Search e in q limit on # enteredtwords hras will get t he p You a Put . in th s “ ” ch conta mark whi pages ph rase
Slide 14: "powerpoint template" "powerpoint template"
Slide 15: • Most powerful technique for getting Title highly relevant outcomes • Available in everytlsearch engine searching in a ti e: m for a ter h os earc title: T fix in term e pre the ut th d by P e p aces f ollowut any s o with
Slide 16: intitle:ipod intitle:ipod
Slide 17: ipod Anywhere on the page Anywhere on the page In page title In page title In URL
Slide 18: • Performing a search in a specific site Site a s te: • Even if the site ihas searching ord in its better to search in searching earch or a w fcapabilities; To s large engines ter n hen e x site: term t prefi P ut the en the t h dress s pace the ad ed by follow
Slide 19: Hillary Clinton site:cnn.com
Slide 20: a ti on of Site d in a por a wor searching o search for then e T spac site: n enter by the rm the llowed he te site: fo Put t efix s the pr ic addres if spec Clinton / Hillary .co.uk e.g.: ws.bbc s ite:ne
Slide 21: URL a searching erm in for a t s earch t a to u wan L: If yo c UR e fi spac by speci n enter wed rm the url: follo the te refix in Put he p t then e name t t he si nn.com inurl:c kuwait e.g.:
Slide 22: kuwait inurl:cnn.com
Slide 23: • It is identical to URL searching in many search engines top-level Domain icularretrieval to • But its useful limitrtyou in a p a searching a term sites ra particular top-level domain h fo To searc c n e the the main : er spa ed by do e n ent follow rm th or site: the te inurl: Put efix n the pr el domai v top-le
Slide 24: Taghreed Alqudsi inurl:.kw
Slide 25: • Check which web pages have linked Link to your organization’s URL searching • So useful in identifying who is interested in your organization • Available in some search engines
Slide 26: http://www.kuniv.edu/
Slide 27: • Allow user to limit his retrieval to Language pages written in a given language searching • Search engine differs in languages provided
Slide 29: • Provided by most major engines Search by • Impossible to determine “date date created” or “date of publication” • Determines page last modification date or when was the last crawling by the search engine
Slide 31: • Limits user retrieval to specific file Searching type by file type • It includes PDF, Word documents, Excel, Powerpoint, … etc
Slide 33: • The process of identifying web pages Boolean that contain a particular combination search of search terms options • AND = must all be presented • OR = any group of terms is accepted • NOT “-”= if its presented the item rejected • Combination could be used along with parentheses () to indicate the order
Slide 34: • All major search engines Boolean automatically AND your query terms search • Available in: options – Syntax approach – Menu approach • If you want a to search for a group of terms: e.g.: Blog AND Kuwait -kids
Slide 35: Kuwaiti blogs Kuwaiti blogs Kuwaiti blogs kids
Slide 36: • Search engine is considered to have Full full boolean capabilities if it provides all boolean operations (AND, OR, Boolean NOT) • Engines vary in syntax used for boolean expressions
Slide 37: Boolean syntax in Boolean pattern Full Boolean major Ask.com A B -C NO engines Google A B OR C -D YES Windows A (B OR C) -D YES Live Yahoo! A B -D YES
Slide 38: • No single search engine covers Search everything engine • Due to differences in: overlap – Crawling – Indexing – Web pages included in databases
Slide 39: • Take extra seconds to look at details Results in each record pages • May Search other databases (images, video, news, … etc) • “Translate this page” • Spell checker • Sponsor results • Clustered results
Slide 40: other DB
Slide 41: Spell checker
Slide 42: Sponsor results
Slide 43: Clustered results
Slide 44: • Some features are provided only for Search users who have account with the engine engine accounts • Spamophobic, don’t be afraid
Slide 46: • Some are geographic (focusing on Specialty sites from one country) search • Some are topical (focusing on a engine particular subject area) • Dmoz.org open directory
Slide 48: • Services that let user search several Meta search engine at the same time search • Most powerful are: engines – Dogpile.com – Ixquick.com – Vivisimo.com – MetaCrawler.com – Search.com
Slide 49: Meta • May not cover all large engines search • Most only return the first 10 or 20 engines records drawbacks • Most syntax does not work • Some present paid listing first
Slide 50: • Just enter a brief statement in main Search search box and answer will appear engine – Calculator shortcuts – Spell Check – Definition – Hotel finder • Check http://www.extremesearcher.com/sho rtcuts
Slide 51: 15*(14+43)
Slide 52: • Website that combines content from more than one source into an Mashups integrated experience • E.g.: A broad range of geographic location code was integrated with maps and aerial images producing exciting way to find and visualize data • Maps can display property for sale or rent
Slide 53: • Send a GeoGreeting http://www. geogreeting.com/main.html Mashups • Who is Sick? • U.S. Fast Food Map • Starbucks Coffee Finder • Ask 500 People • Wikimapia
Slide 54: • Provided by all major search engines Desktop • Download for free search • Index the contents of the PC programs • Used to search all files • The best is Google • Differ in: – Which file types are indexed – How much control you have over what indexed – What searching option are provided
Slide 55: • Searchenginewatch.com Keeping up- • Provide up-to-date news and reports to-date in a clear and readable style • Valuable for both search engine user and web site developer • Access to much of the site content is free • Free weekly newsletter
Slide 56: For listening




Add a comment on Slide 1
If you have a SlideShare account, login to comment; else you can comment as a guest- Favorites & Groups
Showing 1-50 of 1 (more)