SlideShare a Scribd company logo
1 of 34
Ammara Muhammad Ashfaq
INFORMATION RETRIEVAL
TECHNIQUES
Owned by Terra/Lycos.
One of the largest web search engines.
Uses the Inktomi database combined with Direct
Basic search screen is simple, but the advanced
search allows for a full range of search features.
INTRODUCTION
HotBot Launched in May 1996
Founded by Eric Brewer and assistant professor at the University of
California at Berkeley and Paul Gauthier
it was originally owned and operated by Wired Magazine.
It was a very popular search engine in the 1990s, with it’s wild
colors and great results.
The search results were provided by the Inktomi database and
directory results provided by LookSmart and The Open Directory
HISTORY
HISTORY
In 1998 the search engine was acquired by the Lycos Company and languished
with limited development and falling market share.
It was re-launched in 2002 as a meta-like search tool that gave users the option
to search either the Google, Inktomi, Teoma or FAST databases.
HotBot continues to attract a small amount of search traffic and provides results
from either the Ask Jeeves (Teoma) or Google database.
WORKING OF HOTBOT
Hotbot search engine algorithm is based on:
• keywords contained in the title
• keywords meta tags
• keywords prominence and density in a text content
Document length (maximum 800 words for
hotbot) of any of your web pages.
HOTBOT RANKING ALGORITHM
France: http://www.hotbot.lycos.fr
Germany: http://www.hotbot.lycos.de
Italy: http://www.hotbot.lycos.it
Netherlands: http://www.hotbot.lycos.nl
Spain: http://www.hotbot.lycos.es
United Kingdom: http://www.hotbot.lycos.co.uk
HOTBOT AROUND THE WORLD
A crawler is a program that visits Web sites and
reads their pages and other information in
order to create entries for a search engine index
A program that automatically fetches Web
pages. Spiders are used to feed pages to search
engines. It's called a spider because it
crawls over the Web
SPIDER AND WEB CRAWLER
HotBot offers the choice of three search engine
databases:
HotBot (which is actually a Yahoo!/Inktomi database)
Google
Ask Jeeves (the Teoma database)
DATABASES
• Advanced searching capabilities
• Page depth limit
• Advanced search help
• Truncation
• Quick check of three major databases
STRENGTHS
Link searches must be exact
Database size shrunk for awhile
Advanced features have not always worked
right
Does not include all advanced features of
each of the four databases
WEAKNESSES
No cached copies of pages
Only displays a few hits from each domain with no
access to the rest in Inktomi
Same ads at the top push regular results below the fold
Should have a file type limit for PDF, MS Word,
PowerPoint, and Excel files
WEAKNESSES
Default Operation: Processed as an AND
Full Boolean Searching: AND, OR, and NOT
Proximity Searching
Truncation with the * symbol
Case sensitive
Extensive, dynamic stop word list
Word Stemming - Search for grammatical word variants
including plural, singular, and tense.
SEARCH FEATURES
Multiple search terms are processed as an AND operation by
default.
DEFAULT OPERATION
HotBot offers full Boolean searching.
Use the operators AND, OR, and NOT.
Operators must be in upper case. HotBot can also use
for NOT.
Under Word Filters, it has a drop down menu choice for
All the Words, Any of the Words, Not the Words, Exact
Phrase, and Not Exact Phrase.
These can be used to add additional terms or combining
a phrase search with a Boolean search.
BOOLEAN SEARCHING
HotBot and the other Inktomi databases were
sometimes case sensitive for unusual usages of case. If
search terms are entered in all lower case, all upper case,
or with an initial capital, all mixtures of upper and lower
case are searched.
If a search term contains one or more UPPER case
letters in the middle of a word such as arXiv, the search
is limited to only records that exactly match the specified
case.
CASE SENSITIVITY
ANY words with charters after the stem will be
matched to your query term if the search engine
supports truncation.
Thus if we stem bird*, our search will match on the
words birdbrain.
Posing bird* to Hotbot we now get this document
Bird
1,834,510
WORD STEMMING OR
TRUNCATION
NO. Just phrase searching.
PROXIMITY SEARCHING
The display includes the relevance score, title, URL, a brief
extract, and date.
HotBot displays 10 records at a time, by default.
However, users can request displays of 10, 25, 50, 75, or 100
records at a time.
More search engines should give such options. To always go
directly to Advanced Search with the default of 100 records and
the 'Boolean phrase' option, make a bookmark to these Advanced
Search settings, or use their personalization feature.
DISPLAY
Searching title words and links to a specific
URL
acrobat/applet/activex/audio/embed/
flash/form/frame/image/script/
shockwave/table/video/vrml
FIELD SEARCHES
Results are sorted by relevance with
groupings by site available at the end of
each brief record.
The display includes the relevance score,
title, URL, a brief extract, and date. HotBot
displays 10 records at a time, by default.
SORTING
HotBot and the other Inktomi databases have an
extensive, dynamic stop word list.
Many common words and numbers will not be
searched.
The list changes as the frequency of terms in the
database change.
When a stop word is in a phrase, it may not be
obvious that the whole phrase is not being
searched.
STOP WORDS
WILDCARD SEARCHES
Wildcards searching generally places the symbol "*"
after a word. It tells the database to look for
variations of that word. For Example:
Investigation*
Might pull sites with words such as investigation,
investigator, and investigative.
Some search engines allow you to create more
complex queries by grouping AND, OR, NOT,
and NEAR statements using parentheses.
Investigator NEAR (Texas OR Tx)
In the above example, you should pull
investigators in Texas or TX whether the state
name is spelled out in full or abbreviated.
NESTED SEARCHING
Page Type –
Default is Any (Any pages)
Top Page (the root page of a URL ie.
www.unca.edu)
Page Depth - Limits how far down a
subdirectory hierarchy Hotbot Searches
These are useful for finding the primary sites for
organizations or information
UNIQUE FOR HOTBOT
Smaller databases
Less pointing to external pages
Paid advertising or sponsorship for
visibility
Rise of search only sites
FUTURE POSSIBILITIES
HotBot is an interface to advanced web searches, and it
presents a dynamically changing backend.
Both the Inktomi and Direct Hit technologies serve, in
different ways, to provide a relevant list of results
through advanced queries, and both seek to minimize
the commercial influence over search results.
All of these technologies are subject to changes in
technology developments, and changes in the business
environment.
CONCLUSION
Its weaknesses include that it still doesn't
seem to produce the depth and breadth of
some other engines, and that it's advanced
features have not always worked correctly.
As the proliferation of this engine's index
and searching features continues, these
weaknesses should be overcome.
CONCLUSION

More Related Content

What's hot

Newspaper price and page act,1956
Newspaper price and page act,1956Newspaper price and page act,1956
Newspaper price and page act,1956
Anirban Mandal
 
American political system ppt
American political system pptAmerican political system ppt
American political system ppt
esheevers
 
History of journalism
History of journalismHistory of journalism
History of journalism
Jackie Scott
 
History of Printing
History of PrintingHistory of Printing
History of Printing
Mandi Lopez
 
History of press laws in india
History of press laws in indiaHistory of press laws in india
History of press laws in india
forthpillers
 
United states v lopez
United states v lopezUnited states v lopez
United states v lopez
shshipley
 

What's hot (20)

History of magazines
History of magazinesHistory of magazines
History of magazines
 
Prb act,1867
Prb act,1867Prb act,1867
Prb act,1867
 
PRB Act
PRB ActPRB Act
PRB Act
 
Press council ppt
Press council pptPress council ppt
Press council ppt
 
Newspaper price and page act,1956
Newspaper price and page act,1956Newspaper price and page act,1956
Newspaper price and page act,1956
 
Press Law
Press LawPress Law
Press Law
 
American political system ppt
American political system pptAmerican political system ppt
American political system ppt
 
Constitution, Mission and Code of Practice of Pakistan Federal Union of Journ...
Constitution, Mission and Code of Practice of Pakistan Federal Union of Journ...Constitution, Mission and Code of Practice of Pakistan Federal Union of Journ...
Constitution, Mission and Code of Practice of Pakistan Federal Union of Journ...
 
Kannada Journalism
Kannada JournalismKannada Journalism
Kannada Journalism
 
Investigative Journalism
Investigative JournalismInvestigative Journalism
Investigative Journalism
 
News Literacy, Week 4 Lecture
News Literacy, Week 4 LectureNews Literacy, Week 4 Lecture
News Literacy, Week 4 Lecture
 
History of journalism
History of journalismHistory of journalism
History of journalism
 
History of Printing
History of PrintingHistory of Printing
History of Printing
 
Press commissin.
Press commissin.Press commissin.
Press commissin.
 
The History of Printing
The History of PrintingThe History of Printing
The History of Printing
 
Production process of a newspaper
Production process of a newspaperProduction process of a newspaper
Production process of a newspaper
 
Normative/functionalist theories of press
Normative/functionalist theories of pressNormative/functionalist theories of press
Normative/functionalist theories of press
 
History of press laws in india
History of press laws in indiaHistory of press laws in india
History of press laws in india
 
AP news agency
AP news  agencyAP news  agency
AP news agency
 
United states v lopez
United states v lopezUnited states v lopez
United states v lopez
 

Similar to Hotbot ppt

Academic Skills 4
Academic Skills 4Academic Skills 4
Academic Skills 4
Hala Nur
 
google search engine
google search enginegoogle search engine
google search engine
way2go
 
Searching the Internet
Searching the Internet Searching the Internet
Searching the Internet
guest32ae6
 
The Internet
The InternetThe Internet
The Internet
mscuttle
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
PCTE
 
How to become an effective web searcher
How to become an effective web searcherHow to become an effective web searcher
How to become an effective web searcher
rangak
 
Database poll results
Database poll resultsDatabase poll results
Database poll results
Stephen Abram
 

Similar to Hotbot ppt (20)

Internet Research Presentation
Internet Research PresentationInternet Research Presentation
Internet Research Presentation
 
Search Enginesv2
Search Enginesv2Search Enginesv2
Search Enginesv2
 
Academic Skills 4
Academic Skills 4Academic Skills 4
Academic Skills 4
 
Computer study lesson - Internet Search (25 Mar 2020)
Computer study lesson - Internet Search (25 Mar 2020)Computer study lesson - Internet Search (25 Mar 2020)
Computer study lesson - Internet Search (25 Mar 2020)
 
Searching the Web
Searching the WebSearching the Web
Searching the Web
 
google search engine
google search enginegoogle search engine
google search engine
 
Searching the Internet
Searching the Internet Searching the Internet
Searching the Internet
 
The Internet
The InternetThe Internet
The Internet
 
Internet Search
Internet SearchInternet Search
Internet Search
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
How to become an effective web searcher
How to become an effective web searcherHow to become an effective web searcher
How to become an effective web searcher
 
Search engines
Search enginesSearch engines
Search engines
 
Introduction to internet.
Introduction to internet.Introduction to internet.
Introduction to internet.
 
Internet search techniques by zakir hossain
Internet search techniques by zakir hossainInternet search techniques by zakir hossain
Internet search techniques by zakir hossain
 
Internet search techniques for K12
Internet search techniques for K12Internet search techniques for K12
Internet search techniques for K12
 
Database poll results
Database poll resultsDatabase poll results
Database poll results
 
Search Engines
Search EnginesSearch Engines
Search Engines
 
Internet Searching Version2
Internet Searching Version2Internet Searching Version2
Internet Searching Version2
 

Recently uploaded

Recently uploaded (20)

Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
VAMOS CUIDAR DO NOSSO PLANETA! .
VAMOS CUIDAR DO NOSSO PLANETA!                    .VAMOS CUIDAR DO NOSSO PLANETA!                    .
VAMOS CUIDAR DO NOSSO PLANETA! .
 
21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx21st_Century_Skills_Framework_Final_Presentation_2.pptx
21st_Century_Skills_Framework_Final_Presentation_2.pptx
 
Simple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdfSimple, Complex, and Compound Sentences Exercises.pdf
Simple, Complex, and Compound Sentences Exercises.pdf
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17Model Attribute _rec_name in the Odoo 17
Model Attribute _rec_name in the Odoo 17
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17How to Manage Call for Tendor in Odoo 17
How to Manage Call for Tendor in Odoo 17
 
dusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learningdusjagr & nano talk on open tools for agriculture research and learning
dusjagr & nano talk on open tools for agriculture research and learning
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
AIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.pptAIM of Education-Teachers Training-2024.ppt
AIM of Education-Teachers Training-2024.ppt
 
Economic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food AdditivesEconomic Importance Of Fungi In Food Additives
Economic Importance Of Fungi In Food Additives
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...OS-operating systems- ch05 (CPU Scheduling) ...
OS-operating systems- ch05 (CPU Scheduling) ...
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 

Hotbot ppt

  • 1. Ammara Muhammad Ashfaq INFORMATION RETRIEVAL TECHNIQUES
  • 2.
  • 3. Owned by Terra/Lycos. One of the largest web search engines. Uses the Inktomi database combined with Direct Basic search screen is simple, but the advanced search allows for a full range of search features. INTRODUCTION
  • 4. HotBot Launched in May 1996 Founded by Eric Brewer and assistant professor at the University of California at Berkeley and Paul Gauthier it was originally owned and operated by Wired Magazine. It was a very popular search engine in the 1990s, with it’s wild colors and great results. The search results were provided by the Inktomi database and directory results provided by LookSmart and The Open Directory HISTORY
  • 5. HISTORY In 1998 the search engine was acquired by the Lycos Company and languished with limited development and falling market share. It was re-launched in 2002 as a meta-like search tool that gave users the option to search either the Google, Inktomi, Teoma or FAST databases. HotBot continues to attract a small amount of search traffic and provides results from either the Ask Jeeves (Teoma) or Google database.
  • 6.
  • 7.
  • 9.
  • 10. Hotbot search engine algorithm is based on: • keywords contained in the title • keywords meta tags • keywords prominence and density in a text content Document length (maximum 800 words for hotbot) of any of your web pages. HOTBOT RANKING ALGORITHM
  • 11. France: http://www.hotbot.lycos.fr Germany: http://www.hotbot.lycos.de Italy: http://www.hotbot.lycos.it Netherlands: http://www.hotbot.lycos.nl Spain: http://www.hotbot.lycos.es United Kingdom: http://www.hotbot.lycos.co.uk HOTBOT AROUND THE WORLD
  • 12. A crawler is a program that visits Web sites and reads their pages and other information in order to create entries for a search engine index A program that automatically fetches Web pages. Spiders are used to feed pages to search engines. It's called a spider because it crawls over the Web SPIDER AND WEB CRAWLER
  • 13.
  • 14. HotBot offers the choice of three search engine databases: HotBot (which is actually a Yahoo!/Inktomi database) Google Ask Jeeves (the Teoma database) DATABASES
  • 15. • Advanced searching capabilities • Page depth limit • Advanced search help • Truncation • Quick check of three major databases STRENGTHS
  • 16. Link searches must be exact Database size shrunk for awhile Advanced features have not always worked right Does not include all advanced features of each of the four databases WEAKNESSES
  • 17. No cached copies of pages Only displays a few hits from each domain with no access to the rest in Inktomi Same ads at the top push regular results below the fold Should have a file type limit for PDF, MS Word, PowerPoint, and Excel files WEAKNESSES
  • 18.
  • 19. Default Operation: Processed as an AND Full Boolean Searching: AND, OR, and NOT Proximity Searching Truncation with the * symbol Case sensitive Extensive, dynamic stop word list Word Stemming - Search for grammatical word variants including plural, singular, and tense. SEARCH FEATURES
  • 20. Multiple search terms are processed as an AND operation by default. DEFAULT OPERATION
  • 21. HotBot offers full Boolean searching. Use the operators AND, OR, and NOT. Operators must be in upper case. HotBot can also use for NOT. Under Word Filters, it has a drop down menu choice for All the Words, Any of the Words, Not the Words, Exact Phrase, and Not Exact Phrase. These can be used to add additional terms or combining a phrase search with a Boolean search. BOOLEAN SEARCHING
  • 22. HotBot and the other Inktomi databases were sometimes case sensitive for unusual usages of case. If search terms are entered in all lower case, all upper case, or with an initial capital, all mixtures of upper and lower case are searched. If a search term contains one or more UPPER case letters in the middle of a word such as arXiv, the search is limited to only records that exactly match the specified case. CASE SENSITIVITY
  • 23. ANY words with charters after the stem will be matched to your query term if the search engine supports truncation. Thus if we stem bird*, our search will match on the words birdbrain. Posing bird* to Hotbot we now get this document Bird 1,834,510 WORD STEMMING OR TRUNCATION
  • 24. NO. Just phrase searching. PROXIMITY SEARCHING
  • 25. The display includes the relevance score, title, URL, a brief extract, and date. HotBot displays 10 records at a time, by default. However, users can request displays of 10, 25, 50, 75, or 100 records at a time. More search engines should give such options. To always go directly to Advanced Search with the default of 100 records and the 'Boolean phrase' option, make a bookmark to these Advanced Search settings, or use their personalization feature. DISPLAY
  • 26. Searching title words and links to a specific URL acrobat/applet/activex/audio/embed/ flash/form/frame/image/script/ shockwave/table/video/vrml FIELD SEARCHES
  • 27. Results are sorted by relevance with groupings by site available at the end of each brief record. The display includes the relevance score, title, URL, a brief extract, and date. HotBot displays 10 records at a time, by default. SORTING
  • 28. HotBot and the other Inktomi databases have an extensive, dynamic stop word list. Many common words and numbers will not be searched. The list changes as the frequency of terms in the database change. When a stop word is in a phrase, it may not be obvious that the whole phrase is not being searched. STOP WORDS
  • 29. WILDCARD SEARCHES Wildcards searching generally places the symbol "*" after a word. It tells the database to look for variations of that word. For Example: Investigation* Might pull sites with words such as investigation, investigator, and investigative.
  • 30. Some search engines allow you to create more complex queries by grouping AND, OR, NOT, and NEAR statements using parentheses. Investigator NEAR (Texas OR Tx) In the above example, you should pull investigators in Texas or TX whether the state name is spelled out in full or abbreviated. NESTED SEARCHING
  • 31. Page Type – Default is Any (Any pages) Top Page (the root page of a URL ie. www.unca.edu) Page Depth - Limits how far down a subdirectory hierarchy Hotbot Searches These are useful for finding the primary sites for organizations or information UNIQUE FOR HOTBOT
  • 32. Smaller databases Less pointing to external pages Paid advertising or sponsorship for visibility Rise of search only sites FUTURE POSSIBILITIES
  • 33. HotBot is an interface to advanced web searches, and it presents a dynamically changing backend. Both the Inktomi and Direct Hit technologies serve, in different ways, to provide a relevant list of results through advanced queries, and both seek to minimize the commercial influence over search results. All of these technologies are subject to changes in technology developments, and changes in the business environment. CONCLUSION
  • 34. Its weaknesses include that it still doesn't seem to produce the depth and breadth of some other engines, and that it's advanced features have not always worked correctly. As the proliferation of this engine's index and searching features continues, these weaknesses should be overcome. CONCLUSION