SlideShare a Scribd company logo
1 of 11
Download to read offline
Step
1
Database Module
• A graph, with
nodes and edges,
is constructed as
per the links
pointing to the
other web pages.
Bots give full web
pages text to the
indexer.
Stop words like
(for, in, at etc) and
punctuation are
ignored.
The text is
converted to lower
case and stored.
Term Weighting Factor
Term
Frequency
How many times
the term
occurred in the
collected text.
Collection
Frequency
Used to
discriminate one
document from
the other.
Length
Normalization
Long documents
have larger term
set than short
ones.
It is not possible to
keep up with the
growth of web and
update the content
asap. By the time bot
is able to craw
through, its indexed
content gets outdated.
So the web has been
divided into segments
and then the index is
incrementally updated.
Page Rank – Google’s Secret Algorithm
Latest Reputation
Popularity Authority Trustworthy Freshness Relevance
Query Terms
Position Size Proximity
User
Content
Geographic
Region
Web History
The algorithms get
to the deeper
meaning of the
words you type in
the search bar.
A search engine
identifies and
corrects possible
spelling errors and
provides
alternatives.
Autocomplete
predicts what you
might be searching
for. This includes
understanding
terms with more
than one meaning.
The previous
searches help the
engine
comprehend what
the user might be
looking for.
A lot more goes
into displaying the
most relevant
results to the user.
Search engines
like Google rank
based on more
than 200 factors.
How a Search Engine Works
How a Search Engine Works

More Related Content

More from rapidBizApps

Top 5 Benefits of Digitalization in Mining
Top 5 Benefits of Digitalization in MiningTop 5 Benefits of Digitalization in Mining
Top 5 Benefits of Digitalization in MiningrapidBizApps
 
Business Apps for Employees
Business Apps for EmployeesBusiness Apps for Employees
Business Apps for EmployeesrapidBizApps
 
Why Your Business Needs a Mobile Strategy Now
Why Your Business Needs a Mobile Strategy NowWhy Your Business Needs a Mobile Strategy Now
Why Your Business Needs a Mobile Strategy NowrapidBizApps
 
SunSight - Field Management App for Solar Providers
SunSight - Field Management App for Solar ProvidersSunSight - Field Management App for Solar Providers
SunSight - Field Management App for Solar ProvidersrapidBizApps
 
Designing UI and UX for Interactive Virtual Reality Apps
Designing UI and UX for Interactive Virtual Reality AppsDesigning UI and UX for Interactive Virtual Reality Apps
Designing UI and UX for Interactive Virtual Reality AppsrapidBizApps
 
Smart Parking Concept - An Internet of Things Solution
Smart Parking Concept - An Internet of Things SolutionSmart Parking Concept - An Internet of Things Solution
Smart Parking Concept - An Internet of Things SolutionrapidBizApps
 
Design concept for Miller Electric
Design concept for Miller ElectricDesign concept for Miller Electric
Design concept for Miller ElectricrapidBizApps
 
Design Concept for Skender by rapidBizApps
Design Concept for Skender by rapidBizAppsDesign Concept for Skender by rapidBizApps
Design Concept for Skender by rapidBizAppsrapidBizApps
 
Virtual Reality Concept for Atlas Copco
Virtual Reality Concept for Atlas CopcoVirtual Reality Concept for Atlas Copco
Virtual Reality Concept for Atlas CopcorapidBizApps
 

More from rapidBizApps (9)

Top 5 Benefits of Digitalization in Mining
Top 5 Benefits of Digitalization in MiningTop 5 Benefits of Digitalization in Mining
Top 5 Benefits of Digitalization in Mining
 
Business Apps for Employees
Business Apps for EmployeesBusiness Apps for Employees
Business Apps for Employees
 
Why Your Business Needs a Mobile Strategy Now
Why Your Business Needs a Mobile Strategy NowWhy Your Business Needs a Mobile Strategy Now
Why Your Business Needs a Mobile Strategy Now
 
SunSight - Field Management App for Solar Providers
SunSight - Field Management App for Solar ProvidersSunSight - Field Management App for Solar Providers
SunSight - Field Management App for Solar Providers
 
Designing UI and UX for Interactive Virtual Reality Apps
Designing UI and UX for Interactive Virtual Reality AppsDesigning UI and UX for Interactive Virtual Reality Apps
Designing UI and UX for Interactive Virtual Reality Apps
 
Smart Parking Concept - An Internet of Things Solution
Smart Parking Concept - An Internet of Things SolutionSmart Parking Concept - An Internet of Things Solution
Smart Parking Concept - An Internet of Things Solution
 
Design concept for Miller Electric
Design concept for Miller ElectricDesign concept for Miller Electric
Design concept for Miller Electric
 
Design Concept for Skender by rapidBizApps
Design Concept for Skender by rapidBizAppsDesign Concept for Skender by rapidBizApps
Design Concept for Skender by rapidBizApps
 
Virtual Reality Concept for Atlas Copco
Virtual Reality Concept for Atlas CopcoVirtual Reality Concept for Atlas Copco
Virtual Reality Concept for Atlas Copco
 

Recently uploaded

React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfAarwolf Industries LLC
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 

Recently uploaded (20)

React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdf
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 

How a Search Engine Works

  • 1.
  • 2.
  • 4. Database Module • A graph, with nodes and edges, is constructed as per the links pointing to the other web pages.
  • 5. Bots give full web pages text to the indexer. Stop words like (for, in, at etc) and punctuation are ignored. The text is converted to lower case and stored. Term Weighting Factor Term Frequency How many times the term occurred in the collected text. Collection Frequency Used to discriminate one document from the other. Length Normalization Long documents have larger term set than short ones.
  • 6. It is not possible to keep up with the growth of web and update the content asap. By the time bot is able to craw through, its indexed content gets outdated. So the web has been divided into segments and then the index is incrementally updated.
  • 7. Page Rank – Google’s Secret Algorithm Latest Reputation Popularity Authority Trustworthy Freshness Relevance Query Terms Position Size Proximity User Content Geographic Region Web History
  • 8. The algorithms get to the deeper meaning of the words you type in the search bar. A search engine identifies and corrects possible spelling errors and provides alternatives. Autocomplete predicts what you might be searching for. This includes understanding terms with more than one meaning. The previous searches help the engine comprehend what the user might be looking for.
  • 9. A lot more goes into displaying the most relevant results to the user. Search engines like Google rank based on more than 200 factors.