SEARCH ENGINES SEARCH ENGINES Search engines Web Search Page-Specific Search Default search Search optionspicture dictionary ppt compiled by Asha Krishnan Department of Management Studies Pondicherry University fhgrficerocket People Search wwikiasearch krugle
The primary objective - to understand what is a search engine Awareness about classification of search engines To learn about 6 different types of search engines Objectives
. SEARCH ENGINES A web search engine is a tool designed to search for information on the World Wide Web. The search results are usually presented in a list and are commonly called hits. The information may consist of web pages, images, information and other types of files. Some search engines also mine data available in databases or open directories.
More about search engines Web search engines work by about many web pages, which they retrieve from the WWW itself. These pages are retrieved by a Web crawler (sometimes also known as a spider) an automated Web browser which follows every link it sees. As a query is entered into a search engine using key words the engine examines its index and provides a listing of best-matching web pages according to its criteria The contents of each page are then analyzed to determine how it should be indexed for e.g., words extracted from the titles, headings, or special fields called meta tags. Data about web pages are stored in an index database for use in later queries
Open source search engines, Semantic browsing engines, Social search Engines, Metasearch engines, Visual search engines
Google, Yahoo!, Bing The first ever search engine was Archie written in 1990 by Alan Emtag, Bill Heelan and J. Peter Deutsch
Types of search engines Veronica (Very Easy Rodent-Oriented Net-wide Index to Computerized Archives) 1991 had its name inspired from Archie the first search engine
COLLABORATIVE SEARCH ENGINES Collaborative Search Engines(CSEs) are an emerging trend for Web search and search within company intranets. CSEs let users concert their efforts in information retrieval (IR) activities and allow experts to guide less experienced people through their searches. Search terms and links clicked that are shared among users reveal their interests, habits, social relations and intentions. CSEs put the privacy of the users at risk.
META SEARCH ENGINES Metasearch engines create what is known as a virtual database. They do not compile a physical database or catalogue of the web No two Metasearch engines are alike. Users requests taken Passed to heterogeneous databases Compile the results in homogenous manner based on algorithm A Metasearch engine does not have any direct access to the search engines’ database.
APPLICATIONS OF META SEARCH Image meta search- search of images based on associated metadata such as keywords, text, etc. "Reverse" image search- search for images using sophisticated abstractions of chromatic and other related information found within the image.
A video search engine is a web-based search engine which crawls the web for videocontent. Some video search engines parse externally hosted content while others allow content to be uploaded and hosted on their own servers. Some engines also allow users to search by video format type and by length of the clip. Search results are usually accompanied by a thumbnail view of the video. VIDEO SEARCH ENGINE YouTube receives a billion hits per day!
Agnostic search Search that is not affected by the hosting of video, where results are agnostic no matter where the video is located: E.g. AltaVista Video Search,Dabble.com, Picsearch Video Search Non-agnostic Search Search results are modified, or suspect, due to the large hosted video being given preferential treatment in search results E.g. YouTube, Yahoo video search TYPES OF VIDEO SEARCH
Enterprise search is the practice of identifying and enabling specific content across the enterprise to be indexed, searched, and displayed to authorized users. The term "enterprise search" describes the application of search technology to information within an organization ENTERPRISE SEARCH
Major challenge faced by Enterprise search the need to index data and documents from a variety of sources (file systems , intranets, document management systems, e-mail) and then present a consolidated list of relevance ranked resources from these various source . Vendors of proprietary enterprise search software AskMeNow, Attivio, Google Search Appliance, IBM
SEMANTIC SEARCH Semantic search is a process used to improve online searching by using data from semantic networksto disambiguate queries and web text in order to generate more relevant results It provides an overview that lists semantic search systems and identifies other uses of semantics in the search process Semantic Search uses semantics or the science of meaning in language to produce highly relevant search results. The Semantic Web is an evolving development of the World Wide Web where the meaning semantics of information and services on the web is defined, making it possible for the web to understand and satisfy the requests of people.
OPEN SEARCH OpenSearchis a collection of technologies that allow publishing of search results in a format suitable for syndication and aggregation It is a way for websites and search engines to publish search results in a standard and accessible format. OpenSearch was developed by Amazon.com subsidiary A9 and the first version, OpenSearch 1.0, was unveiled by Jeff Bezos at the Web 2.0 conference in March, 2005
Open search consists of Search engines and software that support OpenSearch OpenSearch Description files OpenSearch Query Syntax: Open Search RSS (in OpenSearch 1.0) or OpenSearch Response (in Open Search 1.1) OpenSearch Aggregators OpenSearch "Auto-discovery” Wikipedia suggests articles matching a typed-in entry with incremental find. Internet Explorer 7 and above.
SOCIAL SEARCH Social search or a social search engine is a type of web search method that determines the relevance of search results by considering the interactions or contributions of users. Its user-based approach to relevance is in contrast to established algorithmic or machine-based approaches where relevance is determined by analyzing the text of each document or the link structure of the documents
Takes into account varying sources of metadata, such as collaborative discovery of web pages, tags, social ranking. many forms - ranging from simple shared bookmarks or tagging of content with descriptive labels to more sophisticated approaches that combine human intelligence with computer algorithms. .
Example forms of user input include social bookmarking or direct interaction with the search results such as promoting or demoting results the user feels are more or less relevant to their query Companies in the social search space include Sproose, Parse.ly, Mahalo, Jumper 2.0, Wikia Search In 2008 a few start up companies began the technique of focussing on ranking search results according to one's social graph on social network.
SELECTION BASED SEARCH A selection-based search system is a search engine system in which the user invokes a search query using only the mouse. It allows the user to search the internet for more information about any keywordor phrase contained within a document or webpage in any software application on his desktop computer using the mouse. These search systems create what is known as a semantic database of trained terms. They do not compile a physical database or catalogue of the web on the users' desktop computer.
3 characteristic features invoke search using the mouse from within the context of any application on his desktop. e.g. Microsoft Office, Adobe Reader, Mozilla Firefox receive categorized suggestions based on the context of the user-selected text. view the results in floating information boxes that can be sized, shared, docked, closed and stacked on top of the document that has the user’s primary focus the user can
The other search engine types are Desktop search engines e.g. Ask.com, windows search Usenet e.g. Google groups Search appliances Acquired or defunct search engines e.g. Google answers, Wikia search