SlideShare a Scribd company logo
1
PRESENTATION
ON
GOOGLE
PRESENTEDBY
Sehrish
Akram
2
3
Google, the leading search engine worldwide
Founded in 1998 by Stanford University graduate
students Larry Page and Sergei Brin.
4
5
WHAT IS QUERY
6
SEARCHING TECHNIQUES
Google search engine uses these techniques:
”It is a full-text searching engine”
When we do a Google search actually, we are
searching GOOGLE’s index of the web.
We do this by software program called
“spiders”.
7
SEARCHING TECHNIQUES
Spiders start fetching a few web pages and then
they follow the link and fetch the pages they
point to.
CASE FOLDING technique
Normalized technique e.g.
U.S.A …USA.
8
SEARCHING TECHNIQUES
Case sensitive technique is not also used in
Google if the user search for seven , SEVEN,
Seven or even 7 u get the same results.
Singular is different from plural searches for
apple or apples turn up different pages.
The orders of words matters: Google considers
the first word most important ,the second word
next and so on.
Google ignores most little words including “I”
“an” “ how” “the” “of” “AN”. 9
SEARCHING TECHNIQUES
Google search word limit is 32.
 Wildcards searching generally places the symbol
"*" after a word.
 It tells the database to look for variations of that
word.
For Example: Investigation* Might pull sites
with words such as investigation, investigator,
and investigative.
10
INFORMATION RETRIEVAL AND THE WEB
What We Do
Google WANTED TO organize the web into
something searchable. Their early prototype was
based upon a few basic principles, including:
The best pages tend to be the ones that people
linked to the most.
The best description of a page is often derived
from the anchor text associated with the links to a
page. 11
Anchor text
12
DOCUMENT ACQUISITION AND STORAGE:
Google searches more than 3 billion Web documents,
which includes Web pages, images and Usenet
postings.
Google uses a standalone Web crawler, distributed
trough several machines, to create indexes and copies
of the document.
Besides standard .html files, Google also indexes
other file type including
________
_________
__________
__________
13
DOCUMENT ACQUISITION AND STORAGE:
A copy of each crawled page is stored in
Google’s repository.
Indexes are created using stored words, pointing
to an inverted index file
14
QUERY INTRODUCTION AND USER
OPTIONS:
Since it’s foundation, Google has been steadily
introducing new features.
Google uses Boolean search without nested
expressions support and with some variations.
By default, it automatically uses AND operator
between terms, the minus symbol can be used to
perform a NOT function and the OR operation is
supported (using OR in upper case).
15
Google does not uses stemming, nor truncation,
but allows the use of ‘*’ as a wildcard in the
middle of a phrase. For example, searching for
“Search Engine” wields quite different result
from “Search * Engine”.
Query Introduction and user Options:
16
RESULTS SELECTION AND PRESENTATION
To select which document is presented, Google
combines a document’s Page Rank value, anchor
text and proximity
Results are clustered by server with two visible
results and a link to “More results from server”.
17
RESULTS SELECTION AND PRESENTATION
Google helps users by
correcting misspelled words
in their search queries using,
not a predetermined
dictionary, but it’s own index
of the entire web.
Google visual interface is
one of the simplest and,
according to many, one of the
reasons to Google’s success,
“it’s simple and it works”. 18
LOGICAL DIAGRAM
Web Crawling, Extraction, and Indexing 19

More Related Content

What's hot

Probabilistic retrieval model
Probabilistic retrieval modelProbabilistic retrieval model
Probabilistic retrieval modelbaradhimarch81
 
Digital Library Software
Digital Library SoftwareDigital Library Software
Search Engine
Search EngineSearch Engine
Search Engine
Ankush Srivastava
 
Working Of Search Engine
Working Of Search EngineWorking Of Search Engine
Working Of Search EngineNIKHIL NAIR
 
Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introduction
nimmyjans4
 
Model of information retrieval (3)
Model  of information retrieval (3)Model  of information retrieval (3)
Model of information retrieval (3)9866825059
 
WebCrawler
WebCrawlerWebCrawler
WebCrawler
mynameismrslide
 
Movie lens recommender systems
Movie lens recommender systemsMovie lens recommender systems
Movie lens recommender systems
Kapil Garg
 
Web Information Retrieval and Mining
Web Information Retrieval and MiningWeb Information Retrieval and Mining
Web Information Retrieval and Mining
Carlos Castillo (ChaTo)
 
Information retrieval 14 fuzzy set models of ir
Information retrieval 14 fuzzy set models of irInformation retrieval 14 fuzzy set models of ir
Information retrieval 14 fuzzy set models of ir
Vaibhav Khanna
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrievalNanthini Dominique
 
Text mining
Text miningText mining
Text mining
ThejeswiniChivukula
 
Working of search engine
Working of search engineWorking of search engine
Working of search engine
Nikhil Deswal
 
Components of a search engine
Components of a search engineComponents of a search engine
Components of a search engine
Primya Tamil
 
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I PPT IN PDF
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I  PPT  IN PDFCS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I  PPT  IN PDF
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I PPT IN PDF
AALIM MUHAMMED SALEGH COLLEGE OF ENGINEERING
 
5.3 mining sequential patterns
5.3 mining sequential patterns5.3 mining sequential patterns
5.3 mining sequential patterns
Krish_ver2
 
Information retrieval-systems notes
Information retrieval-systems notesInformation retrieval-systems notes
Information retrieval-systems notes
BAIRAVI T
 
Social media mining PPT
Social media mining PPTSocial media mining PPT
Social media mining PPT
Chhavi Mathur
 

What's hot (20)

Probabilistic retrieval model
Probabilistic retrieval modelProbabilistic retrieval model
Probabilistic retrieval model
 
Digital Library Software
Digital Library SoftwareDigital Library Software
Digital Library Software
 
Search Engine
Search EngineSearch Engine
Search Engine
 
Working Of Search Engine
Working Of Search EngineWorking Of Search Engine
Working Of Search Engine
 
Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introduction
 
Model of information retrieval (3)
Model  of information retrieval (3)Model  of information retrieval (3)
Model of information retrieval (3)
 
WebCrawler
WebCrawlerWebCrawler
WebCrawler
 
Movie lens recommender systems
Movie lens recommender systemsMovie lens recommender systems
Movie lens recommender systems
 
Web Information Retrieval and Mining
Web Information Retrieval and MiningWeb Information Retrieval and Mining
Web Information Retrieval and Mining
 
Information retrieval 14 fuzzy set models of ir
Information retrieval 14 fuzzy set models of irInformation retrieval 14 fuzzy set models of ir
Information retrieval 14 fuzzy set models of ir
 
Vector space model of information retrieval
Vector space model of information retrievalVector space model of information retrieval
Vector space model of information retrieval
 
Web Mining
Web MiningWeb Mining
Web Mining
 
Text mining
Text miningText mining
Text mining
 
Working of search engine
Working of search engineWorking of search engine
Working of search engine
 
Components of a search engine
Components of a search engineComponents of a search engine
Components of a search engine
 
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I PPT IN PDF
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I  PPT  IN PDFCS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I  PPT  IN PDF
CS8080 INFORMATION RETRIEVAL TECHNIQUES - IRT - UNIT - I PPT IN PDF
 
5.3 mining sequential patterns
5.3 mining sequential patterns5.3 mining sequential patterns
5.3 mining sequential patterns
 
Information retrieval-systems notes
Information retrieval-systems notesInformation retrieval-systems notes
Information retrieval-systems notes
 
web mining
web miningweb mining
web mining
 
Social media mining PPT
Social media mining PPTSocial media mining PPT
Social media mining PPT
 

Viewers also liked

Introduction to Information Retrieval & Models
Introduction to Information Retrieval & ModelsIntroduction to Information Retrieval & Models
Introduction to Information Retrieval & Models
Mounia Lalmas-Roelleke
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrievalSadaf Rafiq
 
Information retrieval system!
Information retrieval system!Information retrieval system!
Information retrieval system!
Jane Garay
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
Roi Blanco
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)
Kira
 
Information Retrieval
Information RetrievalInformation Retrieval
Storage And Retrieval Of Information
Storage And Retrieval Of InformationStorage And Retrieval Of Information
Storage And Retrieval Of InformationMarcus9000
 
Techniques of information retrieval
Techniques of information retrieval Techniques of information retrieval
Techniques of information retrieval
Tariq Hassan
 
Search Engine Google
Search Engine GoogleSearch Engine Google
Search Engine Google
Chidanand Byahatti
 
Ir 01
Ir   01Ir   01
Google Search Engine
Google Search Engine Google Search Engine
Google Search Engine
Aniket_1415
 
Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013
Artificial Intelligence Institute at UofSC
 
Tdm information retrieval
Tdm information retrievalTdm information retrieval
Tdm information retrievalKU Leuven
 
Knoesis Student Achievement
Knoesis Student AchievementKnoesis Student Achievement
Knoesis Student Achievement
Artificial Intelligence Institute at UofSC
 
Google Search Engine
Google Search EngineGoogle Search Engine
Google Search Engine
guestf460ed0
 
Computers in pharmacy
Computers in pharmacyComputers in pharmacy
Computers in pharmacy
Ashley Toups
 
Search Analytics with ELK (Elastic Stack)
Search Analytics with ELK (Elastic Stack)Search Analytics with ELK (Elastic Stack)
Search Analytics with ELK (Elastic Stack)
MC+A
 
Trust Management: A Tutorial
Trust Management: A TutorialTrust Management: A Tutorial
Trust Management: A Tutorial
Artificial Intelligence Institute at UofSC
 

Viewers also liked (20)

Introduction to Information Retrieval & Models
Introduction to Information Retrieval & ModelsIntroduction to Information Retrieval & Models
Introduction to Information Retrieval & Models
 
Information storage and retrieval
Information storage and retrievalInformation storage and retrieval
Information storage and retrieval
 
Information retrieval system!
Information retrieval system!Information retrieval system!
Information retrieval system!
 
Introduction to Information Retrieval
Introduction to Information RetrievalIntroduction to Information Retrieval
Introduction to Information Retrieval
 
Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)Tutorial 1 (information retrieval basics)
Tutorial 1 (information retrieval basics)
 
Information Retrieval
Information RetrievalInformation Retrieval
Information Retrieval
 
IR
IRIR
IR
 
Storage And Retrieval Of Information
Storage And Retrieval Of InformationStorage And Retrieval Of Information
Storage And Retrieval Of Information
 
Techniques of information retrieval
Techniques of information retrieval Techniques of information retrieval
Techniques of information retrieval
 
Search Engine Google
Search Engine GoogleSearch Engine Google
Search Engine Google
 
Ir 01
Ir   01Ir   01
Ir 01
 
Google Search Engine
Google Search Engine Google Search Engine
Google Search Engine
 
Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013
 
Tdm information retrieval
Tdm information retrievalTdm information retrieval
Tdm information retrieval
 
Knoesis Student Achievement
Knoesis Student AchievementKnoesis Student Achievement
Knoesis Student Achievement
 
Google Search Engine
Google Search EngineGoogle Search Engine
Google Search Engine
 
Computers in pharmacy
Computers in pharmacyComputers in pharmacy
Computers in pharmacy
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
Search Analytics with ELK (Elastic Stack)
Search Analytics with ELK (Elastic Stack)Search Analytics with ELK (Elastic Stack)
Search Analytics with ELK (Elastic Stack)
 
Trust Management: A Tutorial
Trust Management: A TutorialTrust Management: A Tutorial
Trust Management: A Tutorial
 

Similar to Information Retrieval Techniques of Google

Internet search techniques by zakir hossain
Internet search techniques by zakir hossainInternet search techniques by zakir hossain
Internet search techniques by zakir hossain
Zakir Hossain/ICS, Zurich
 
20 great google secrets
20 great google secrets20 great google secrets
20 great google secrets
Shakil Malik
 
Google ppt by amit
Google ppt by amitGoogle ppt by amit
Google ppt by amit
DAVV
 
Inside google search - how it works??
Inside google search - how it works??Inside google search - how it works??
Inside google search - how it works??
Dhruv Patel
 
Mastering google search (i'm using it now)
Mastering google search (i'm using it now)Mastering google search (i'm using it now)
Mastering google search (i'm using it now)
Ye Win
 
Google.com and beyond adjusted
Google.com and beyond adjustedGoogle.com and beyond adjusted
Google.com and beyond adjustedMatsuri Maikami
 
How Google Search Works
How Google Search WorksHow Google Search Works
How Google Search Works
malisetti maneesha
 
How does google search engine work
How does google search engine workHow does google search engine work
How does google search engine work
HamnaGul6
 
The Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineThe Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search Engine
Manish Chopra
 
GOOGLE HUMMINGBIRD ALGORITHM
GOOGLE HUMMINGBIRD ALGORITHMGOOGLE HUMMINGBIRD ALGORITHM
GOOGLE HUMMINGBIRD ALGORITHM
Ashok Sahu
 
Google Searchology
Google SearchologyGoogle Searchology
Google Searchology
Maheshkumar Darji
 
How google works and functions: A complete Approach
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete ApproachPrakhar Gethe
 
Google - A presentation by Pushpendra Singh Dangi
Google - A presentation by Pushpendra Singh DangiGoogle - A presentation by Pushpendra Singh Dangi
Google - A presentation by Pushpendra Singh Dangi
Pushpendra Singh Dangi
 
20 great google secrets
20 great google secrets20 great google secrets
20 great google secrets
Ranit Mukherjee
 
[Tips] how to use google search efficiently
[Tips] how to use google search efficiently[Tips] how to use google search efficiently
[Tips] how to use google search efficiently
Sanjeev Kumar Jaiswal
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniques
Nirav Ranpara
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniquesNirav Ranpara
 
How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??
Viral Shah
 

Similar to Information Retrieval Techniques of Google (20)

Internet search techniques by zakir hossain
Internet search techniques by zakir hossainInternet search techniques by zakir hossain
Internet search techniques by zakir hossain
 
20 great google secrets
20 great google secrets20 great google secrets
20 great google secrets
 
Google ppt by amit
Google ppt by amitGoogle ppt by amit
Google ppt by amit
 
Google
GoogleGoogle
Google
 
Inside google search - how it works??
Inside google search - how it works??Inside google search - how it works??
Inside google search - how it works??
 
Mastering google search (i'm using it now)
Mastering google search (i'm using it now)Mastering google search (i'm using it now)
Mastering google search (i'm using it now)
 
Google.com and beyond adjusted
Google.com and beyond adjustedGoogle.com and beyond adjusted
Google.com and beyond adjusted
 
How Google Search Works
How Google Search WorksHow Google Search Works
How Google Search Works
 
How does google search engine work
How does google search engine workHow does google search engine work
How does google search engine work
 
The Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineThe Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search Engine
 
GOOGLE HUMMINGBIRD ALGORITHM
GOOGLE HUMMINGBIRD ALGORITHMGOOGLE HUMMINGBIRD ALGORITHM
GOOGLE HUMMINGBIRD ALGORITHM
 
Google Searchology
Google SearchologyGoogle Searchology
Google Searchology
 
How google works and functions: A complete Approach
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete Approach
 
Google - A presentation by Pushpendra Singh Dangi
Google - A presentation by Pushpendra Singh DangiGoogle - A presentation by Pushpendra Singh Dangi
Google - A presentation by Pushpendra Singh Dangi
 
20 great google secrets
20 great google secrets20 great google secrets
20 great google secrets
 
20 great google secrets
20 great google secrets20 great google secrets
20 great google secrets
 
[Tips] how to use google search efficiently
[Tips] how to use google search efficiently[Tips] how to use google search efficiently
[Tips] how to use google search efficiently
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniques
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniques
 
How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??
 

Recently uploaded

State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 

Recently uploaded (20)

State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 

Information Retrieval Techniques of Google

  • 1. 1
  • 3. 3
  • 4. Google, the leading search engine worldwide Founded in 1998 by Stanford University graduate students Larry Page and Sergei Brin. 4
  • 5. 5
  • 7. SEARCHING TECHNIQUES Google search engine uses these techniques: ”It is a full-text searching engine” When we do a Google search actually, we are searching GOOGLE’s index of the web. We do this by software program called “spiders”. 7
  • 8. SEARCHING TECHNIQUES Spiders start fetching a few web pages and then they follow the link and fetch the pages they point to. CASE FOLDING technique Normalized technique e.g. U.S.A …USA. 8
  • 9. SEARCHING TECHNIQUES Case sensitive technique is not also used in Google if the user search for seven , SEVEN, Seven or even 7 u get the same results. Singular is different from plural searches for apple or apples turn up different pages. The orders of words matters: Google considers the first word most important ,the second word next and so on. Google ignores most little words including “I” “an” “ how” “the” “of” “AN”. 9
  • 10. SEARCHING TECHNIQUES Google search word limit is 32.  Wildcards searching generally places the symbol "*" after a word.  It tells the database to look for variations of that word. For Example: Investigation* Might pull sites with words such as investigation, investigator, and investigative. 10
  • 11. INFORMATION RETRIEVAL AND THE WEB What We Do Google WANTED TO organize the web into something searchable. Their early prototype was based upon a few basic principles, including: The best pages tend to be the ones that people linked to the most. The best description of a page is often derived from the anchor text associated with the links to a page. 11
  • 13. DOCUMENT ACQUISITION AND STORAGE: Google searches more than 3 billion Web documents, which includes Web pages, images and Usenet postings. Google uses a standalone Web crawler, distributed trough several machines, to create indexes and copies of the document. Besides standard .html files, Google also indexes other file type including ________ _________ __________ __________ 13
  • 14. DOCUMENT ACQUISITION AND STORAGE: A copy of each crawled page is stored in Google’s repository. Indexes are created using stored words, pointing to an inverted index file 14
  • 15. QUERY INTRODUCTION AND USER OPTIONS: Since it’s foundation, Google has been steadily introducing new features. Google uses Boolean search without nested expressions support and with some variations. By default, it automatically uses AND operator between terms, the minus symbol can be used to perform a NOT function and the OR operation is supported (using OR in upper case). 15
  • 16. Google does not uses stemming, nor truncation, but allows the use of ‘*’ as a wildcard in the middle of a phrase. For example, searching for “Search Engine” wields quite different result from “Search * Engine”. Query Introduction and user Options: 16
  • 17. RESULTS SELECTION AND PRESENTATION To select which document is presented, Google combines a document’s Page Rank value, anchor text and proximity Results are clustered by server with two visible results and a link to “More results from server”. 17
  • 18. RESULTS SELECTION AND PRESENTATION Google helps users by correcting misspelled words in their search queries using, not a predetermined dictionary, but it’s own index of the entire web. Google visual interface is one of the simplest and, according to many, one of the reasons to Google’s success, “it’s simple and it works”. 18
  • 19. LOGICAL DIAGRAM Web Crawling, Extraction, and Indexing 19