SlideShare a Scribd company logo
GRAPH THEORY IN
SEARCH ENGINES
Preethi Satsangi
2102818
BSc.CS
Understanding the web begins with visualising it as a directed graph, a
concept where pages are nodes connected by links that have direction.
Vertices and Edges
•Vertices (Nodes): Each webpage is a vertex in the graph. The size of
the vertex set can be used to gauge the scale of a web graph.
•Edges (Links): Hyperlinks that direct from one page to another form the
edges. The direction of an edge indicates the flow of navigation from one
page to the next.
Directed Graphs
•Unidirectional Flow: Unlike undirected graphs, where edges have no
orientation, the web graph's edges are directed, reflecting the one-way
nature of hyperlinks.
•Cycles: Cycles occur when a path of directed edges forms a loop, which
is significant in understanding web navigation patterns.
Web Crawling
This is the process by which search engines visit and index
webpages, interpreting the web graph to build a database of
information.
•Crawlers (Spiders): These bots systematically visit webpages,
tracing the graph's edges to discover new vertices.
•Indexing Strategies: Different strategies determine the order in
which pages are visited and indexed, impacting the freshness and
comprehensiveness of the search results.
PageRank Algorithm
PageRank is foundational to Google's search technology, applying
graph theory to rank web pages. Here is an overview of how the
PageRank algorithm works:
•Link Analysis: The algorithm treats links as votes, with the idea that
pages linked by important pages are themselves likely to be important.
•Iterative Calculation: PageRank iteratively calculates the importance
of each page based on the importance of the pages that link to it.
•Damping Factor: Typically set around 0.85, this factor accounts for
the probability that a user may stop following links and start a new
search.
Conclusion
In sum, graph theory is integral to understanding and navigating the
web. By applying graph theory to the web,we gain valuable insights into
the structure and dynamics of online information. It is a powerful tool that
continues to shape the way we access and analyse data on the internet,
making it an indispensable subject for those looking to delve deeper into
the field of computer science.
THANK YOU

More Related Content

Similar to Graph theory in Search engines and web connectivity.pptx

Mining web-logs-to-improve-website-organization1
Mining web-logs-to-improve-website-organization1Mining web-logs-to-improve-website-organization1
Mining web-logs-to-improve-website-organization1
Ijcem Journal
 
Aggregate rank bringing order to web sites
Aggregate rank  bringing order to web sitesAggregate rank  bringing order to web sites
Aggregate rank bringing order to web sites
OUM SAOKOSAL
 
Page rank algortihm
Page rank algortihmPage rank algortihm
Page rank algortihm
Siddharth Kar
 

Similar to Graph theory in Search engines and web connectivity.pptx (20)

Incremental Page Rank Computation on Evolving Graphs : NOTES
Incremental Page Rank Computation on Evolving Graphs : NOTESIncremental Page Rank Computation on Evolving Graphs : NOTES
Incremental Page Rank Computation on Evolving Graphs : NOTES
 
Web mining (structure mining)
Web mining (structure mining)Web mining (structure mining)
Web mining (structure mining)
 
Mining web-logs-to-improve-website-organization1
Mining web-logs-to-improve-website-organization1Mining web-logs-to-improve-website-organization1
Mining web-logs-to-improve-website-organization1
 
PageRank Algorithm
PageRank AlgorithmPageRank Algorithm
PageRank Algorithm
 
Comparative study of different ranking algorithms adopted by search engine
Comparative study of  different ranking algorithms adopted by search engineComparative study of  different ranking algorithms adopted by search engine
Comparative study of different ranking algorithms adopted by search engine
 
Web Mining.pptx
Web Mining.pptxWeb Mining.pptx
Web Mining.pptx
 
A Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTESA Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTES
 
Aggregate rank bringing order to web sites
Aggregate rank  bringing order to web sitesAggregate rank  bringing order to web sites
Aggregate rank bringing order to web sites
 
Page Rank
Page RankPage Rank
Page Rank
 
Enhancement in Weighted PageRank Algorithm Using VOL
Enhancement in Weighted PageRank Algorithm Using VOLEnhancement in Weighted PageRank Algorithm Using VOL
Enhancement in Weighted PageRank Algorithm Using VOL
 
Link analysis for web search
Link analysis for web searchLink analysis for web search
Link analysis for web search
 
Web crawling
Web crawlingWeb crawling
Web crawling
 
PageRank algorithm and its variations: A Survey report
PageRank algorithm and its variations: A Survey reportPageRank algorithm and its variations: A Survey report
PageRank algorithm and its variations: A Survey report
 
“Web crawler”
“Web crawler”“Web crawler”
“Web crawler”
 
Dm page rank
Dm page rankDm page rank
Dm page rank
 
Web mining
Web miningWeb mining
Web mining
 
Q3
Q3Q3
Q3
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithm
 
Page rank algortihm
Page rank algortihmPage rank algortihm
Page rank algortihm
 
[LvDuit//Lab] Crawling the web
[LvDuit//Lab] Crawling the web[LvDuit//Lab] Crawling the web
[LvDuit//Lab] Crawling the web
 

Recently uploaded

Production 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptxProduction 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptx
ChloeMeadows1
 
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkkaudience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
lolsDocherty
 
Article writing on excessive use of internet.pptx
Article writing on excessive use of internet.pptxArticle writing on excessive use of internet.pptx
Article writing on excessive use of internet.pptx
abhinandnam9997
 
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
aagad
 

Recently uploaded (14)

How Do I Begin the Linksys Velop Setup Process?
How Do I Begin the Linksys Velop Setup Process?How Do I Begin the Linksys Velop Setup Process?
How Do I Begin the Linksys Velop Setup Process?
 
Production 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptxProduction 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptx
 
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkkaudience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
 
Case study on merger of Vodafone and Idea (VI).pptx
Case study on merger of Vodafone and Idea (VI).pptxCase study on merger of Vodafone and Idea (VI).pptx
Case study on merger of Vodafone and Idea (VI).pptx
 
Pvtaan Social media marketing proposal.pdf
Pvtaan Social media marketing proposal.pdfPvtaan Social media marketing proposal.pdf
Pvtaan Social media marketing proposal.pdf
 
Article writing on excessive use of internet.pptx
Article writing on excessive use of internet.pptxArticle writing on excessive use of internet.pptx
Article writing on excessive use of internet.pptx
 
Premier Mobile App Development Agency in USA.pdf
Premier Mobile App Development Agency in USA.pdfPremier Mobile App Development Agency in USA.pdf
Premier Mobile App Development Agency in USA.pdf
 
Statistical Analysis of DNS Latencies.pdf
Statistical Analysis of DNS Latencies.pdfStatistical Analysis of DNS Latencies.pdf
Statistical Analysis of DNS Latencies.pdf
 
The Use of AI in Indonesia Election 2024: A Case Study
The Use of AI in Indonesia Election 2024: A Case StudyThe Use of AI in Indonesia Election 2024: A Case Study
The Use of AI in Indonesia Election 2024: A Case Study
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
 
Bug Bounty Blueprint : A Beginner's Guide
Bug Bounty Blueprint : A Beginner's GuideBug Bounty Blueprint : A Beginner's Guide
Bug Bounty Blueprint : A Beginner's Guide
 
Cyber Security Services Unveiled: Strategies to Secure Your Digital Presence
Cyber Security Services Unveiled: Strategies to Secure Your Digital PresenceCyber Security Services Unveiled: Strategies to Secure Your Digital Presence
Cyber Security Services Unveiled: Strategies to Secure Your Digital Presence
 
ER(Entity Relationship) Diagram for online shopping - TAE
ER(Entity Relationship) Diagram for online shopping - TAEER(Entity Relationship) Diagram for online shopping - TAE
ER(Entity Relationship) Diagram for online shopping - TAE
 
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
 

Graph theory in Search engines and web connectivity.pptx

  • 1. GRAPH THEORY IN SEARCH ENGINES Preethi Satsangi 2102818 BSc.CS
  • 2. Understanding the web begins with visualising it as a directed graph, a concept where pages are nodes connected by links that have direction. Vertices and Edges •Vertices (Nodes): Each webpage is a vertex in the graph. The size of the vertex set can be used to gauge the scale of a web graph. •Edges (Links): Hyperlinks that direct from one page to another form the edges. The direction of an edge indicates the flow of navigation from one page to the next. Directed Graphs •Unidirectional Flow: Unlike undirected graphs, where edges have no orientation, the web graph's edges are directed, reflecting the one-way nature of hyperlinks. •Cycles: Cycles occur when a path of directed edges forms a loop, which is significant in understanding web navigation patterns.
  • 3. Web Crawling This is the process by which search engines visit and index webpages, interpreting the web graph to build a database of information. •Crawlers (Spiders): These bots systematically visit webpages, tracing the graph's edges to discover new vertices. •Indexing Strategies: Different strategies determine the order in which pages are visited and indexed, impacting the freshness and comprehensiveness of the search results.
  • 4. PageRank Algorithm PageRank is foundational to Google's search technology, applying graph theory to rank web pages. Here is an overview of how the PageRank algorithm works: •Link Analysis: The algorithm treats links as votes, with the idea that pages linked by important pages are themselves likely to be important. •Iterative Calculation: PageRank iteratively calculates the importance of each page based on the importance of the pages that link to it. •Damping Factor: Typically set around 0.85, this factor accounts for the probability that a user may stop following links and start a new search.
  • 5. Conclusion In sum, graph theory is integral to understanding and navigating the web. By applying graph theory to the web,we gain valuable insights into the structure and dynamics of online information. It is a powerful tool that continues to shape the way we access and analyse data on the internet, making it an indispensable subject for those looking to delve deeper into the field of computer science.

Editor's Notes

  1. Search engines are the tools that navigate the web graph, bringing structure to the chaos of the internet.