2. Agenda
• What is Search Engine?
• Type of Search Engine
• Interesting Facts about Google
• Historical Facts about Google and Its Data Centers.
• How Crawler Based SE Works?
• Ranking Factors
• Difference between Organic and Inorganic
• Using Search Operators
3. What is Search Engine?
Definition 1 :
Search Engines are programs that search documents for specified keywords and
returns a list of the documents where the keywords were found. A Search
Engine is really a general class of programs, however, the term is often used to
specifically describe systems like Google, Bing and Yahoo! Search that enable
users to search for documents on the World Wide Web.
Definition 2:
A program that searches for and identifies items in a database that correspond
to keywords or characters specified by the user, used especially for finding
particular sites on the Internet.
4. Types of Search Engine?
Types of SE Example 1 Example 2
Crawler Based Google Yahoo
Directories Yahoo Directory Open Directory
Hybrid Search Engines Yahoo Google
Specialty Search Engines Flipkart Naukri
5. Interesting Facts about
Google
• 20 Petabytes Data Google process daily.
• Almost every person uses internet uses Google at least once
• If at least 20% of people uses the feature it will be included .
• 620 000 000 Daily visitors to Google
• Founder was not aware about HTML, that’s why they kept home page simple.
• Each Year Google receives 1,000,000 resumes > 0.5% gets selected.
• Back in 1999 Excite was a major player in industry.
• Larry and Sergey was ready to sell Google to Excite below $1 million.
• Google used to take 29 interviews before final offer. Now it is up to 9.
• Acquired 127 companies in 12 Years and have 83 International offices.
6. Historical Facts about Google
Year Steps toward success …
1995 Larry Page and Sergey Brin met at Stanford.
1996 Larry and Sergey begin collaborating on a search engine called BackRub.
1997 Google.com is registered as a domain on September 15.
1998 On September 4, Google files for incorporation in California.
1999 They move to Mountain View location: 2400 Bayshore.
2000 Google AdWords launches with 350 customers.
2001 Opened first international office, in Tokyo, Japan.
2002 Launched Google Shopping site, Name was Froogle.
2003 Announced Google AdSense.
2004 Launched Orkut / Gmail and acquired Picasa.
2005 Launched Google Maps / Google Analytics / GTalk
7. Historical Facts about Google
Year Steps toward success …
2006 Launched Calendar / Wallet / Trends.
2007 "Fortune" announces list of Best Companies to Work For and Google is
#1
2008 T-Mobile announces the G1, the first phone built on the Android OS.
2009 Rolled out Mac and Linux versions of Google Chrome.
2010 Developed technology for cars that can drive themselves.
2011 Launched Google+ Pages to connect you with the businesses.
2012 Google Drive launched. Create, share, collaborate and keep your files.
2013 Announced Calico, a new company that will focus on health.
2014 What
2015 Comes
2016 Next ???
9. How Crawler Based SE
Works?
Crawling
Crawling pages is done by
search engine automated
robots, commonly referred to
as “spiders”.
The spiders “read” one page
and then follow any links
from that page to another
page. Through links the
spiders can reach billions of
interconnected documents.
10. How Crawler Based SE
Works?
Indexing
Indexing is the process by which search engines select pieces of relevant code
(including keywords and surrounding text) from the web page and catalog them. They
store that code and related information in data centers from around the world.
Retrieving
Retrieving comes into play when a search engine user types in a keyword or a string of
keywords. The search engine goes into action retrieving all of the URL’s that it has
stored which are relevant to the keyword and returns this information to the user.
Ranking
Ranking of web pages is essential for the satisfaction of the user’s query. Search
engines rank each web page that they find according to things like trust factors, page
rank and even go as far as considering the user’s search history and where they are
geographically located.
Over 200 factors are considered before providing results to the user.
11. Ranking Factors
Domain Age Link Location on Page
Keyword Appears in Top Level Domain Link Location In Content
Server Location PR of Linking Page
WC3 validation Links from .edu or .gov Domains
Unique Content Nofollow Links
Page Rank Links from Bad Neighborhoods
Quality Links Anchor Text in Links
One Way Link TrustRank
Two Way Link Links from Blog
Three Way Links Do Follow Links
Link Wheel Relevency
14. What is Google Sandbox Effect ?
The Google Sandbox Effect is a theory used to explain
why newly-registered domains or domains with
frequent ownership changes rank poorly
in Google Search Engine Results Pages (SERPS).