SlideShare a Scribd company logo
1 of 18
Motivation When searching for information on the WWW, user perform a query to a search engine. The engine return, as the query’s result, a list of Web sites which usually is a huge set. So the ranking of these web sites is very important. Because much information is contained in the link-structure of the WWW, information such as which pages are linked to others can be used to augment search algorithms.
[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
SALSA----Idea SALSA is based upon the theory of Markov chains,  and relies on the stochastic properties of random walks  performed on our collection of sites. The input to our scheme consists of a collection of sites  C  which is built around a topic  t . Intuition  suggests that authoritative sites on topic  t  should be visible from many sites in the subgraph induced by  C .  Thus, a random walk on this subgraph will visit t -authorities with high probability.
SALSA----Idea Combine the theory of random walks with the notion  of the two distinct types of Web sites, hubs and  authorities, and actually analyze two different Markov  chains: A chain of hubs and a chain of authorities.  Analyzing both chains allows our approach to give each Web site two distinct scores, a hub score and an  authority score.
[object Object],[object Object],[object Object],[object Object],[object Object]
SALSA the principal community of authorities(hubs) found by the SALSA will be composed of the sites whose entries in the principal eigenvector of  A  ( H ) are the highest.
SALSA----Conclusion SALSA is a new stochastic approach for link structure analysis, which examines random walks on graphs derived from the link structure.  The principal community of authorities(hubs) corresponds to the sites that are most frequently visited by the random walk defined by the authority(hub) Markov chain.
The PageRank Citation Ranking: Bringing Order to the Web Larry Page etc. Stanford University
PageRank----Idea Every page has some number of forward links(outedges) and backlinks(inedges)
PageRank----Idea ,[object Object],[object Object]
PageRank----Idea ,[object Object],A page has high rank if the sum of the ranks of its backlinks is high. This covers both the case when a page has many backlinks and when a page has a few highly ranked backlinks.
PageRank----Definition u: a web page F u :  set of pages u points to  B u :  set of pages that point to u N u =|F u |:  the number of links from u  c: a factor used for normalization The equation is recursive, but it may be computed by starting with any set of ranks and iterating the computation until it converges.
PageRank----definition A problem with above definition:  rank sink If two web pages point to each other but to no other page, during the iteration, this loop will accumulate rank but  never distribute any rank.
PageRank----definition Definition modified: E(u) is some vector over the web pages(for example uniform, favorite page etc.) that corresponds to a source of rank.  E(u) is a user designed parameter.
PageRank----Random Surfer Model ,[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]

More Related Content

Viewers also liked (8)

Ona Eraikuntza Iii 09.12.14
Ona Eraikuntza Iii 09.12.14Ona Eraikuntza Iii 09.12.14
Ona Eraikuntza Iii 09.12.14
 
13. TUGASAN 4.
13. TUGASAN 4.13. TUGASAN 4.
13. TUGASAN 4.
 
091112 R T Andiarena Aranburu
091112 R T Andiarena Aranburu091112 R T Andiarena Aranburu
091112 R T Andiarena Aranburu
 
Rett Disorder Syndrome
Rett Disorder SyndromeRett Disorder Syndrome
Rett Disorder Syndrome
 
Tara Public Presentation
Tara Public PresentationTara Public Presentation
Tara Public Presentation
 
ondare industriala villabonan
ondare industriala villabonanondare industriala villabonan
ondare industriala villabonan
 
Kajian Tindakan Dalam Pendidikan Upload
Kajian Tindakan Dalam Pendidikan UploadKajian Tindakan Dalam Pendidikan Upload
Kajian Tindakan Dalam Pendidikan Upload
 
Hubungan Antara Tahap Kesediaan Pembelajaran Arahan Kendiri Dengan Tahap Peng...
Hubungan Antara Tahap Kesediaan Pembelajaran Arahan Kendiri Dengan Tahap Peng...Hubungan Antara Tahap Kesediaan Pembelajaran Arahan Kendiri Dengan Tahap Peng...
Hubungan Antara Tahap Kesediaan Pembelajaran Arahan Kendiri Dengan Tahap Peng...
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

Pagerank

  • 1. Motivation When searching for information on the WWW, user perform a query to a search engine. The engine return, as the query’s result, a list of Web sites which usually is a huge set. So the ranking of these web sites is very important. Because much information is contained in the link-structure of the WWW, information such as which pages are linked to others can be used to augment search algorithms.
  • 2.
  • 3.
  • 4. SALSA----Idea SALSA is based upon the theory of Markov chains, and relies on the stochastic properties of random walks performed on our collection of sites. The input to our scheme consists of a collection of sites C which is built around a topic t . Intuition suggests that authoritative sites on topic t should be visible from many sites in the subgraph induced by C . Thus, a random walk on this subgraph will visit t -authorities with high probability.
  • 5. SALSA----Idea Combine the theory of random walks with the notion of the two distinct types of Web sites, hubs and authorities, and actually analyze two different Markov chains: A chain of hubs and a chain of authorities. Analyzing both chains allows our approach to give each Web site two distinct scores, a hub score and an authority score.
  • 6.
  • 7. SALSA the principal community of authorities(hubs) found by the SALSA will be composed of the sites whose entries in the principal eigenvector of A ( H ) are the highest.
  • 8. SALSA----Conclusion SALSA is a new stochastic approach for link structure analysis, which examines random walks on graphs derived from the link structure. The principal community of authorities(hubs) corresponds to the sites that are most frequently visited by the random walk defined by the authority(hub) Markov chain.
  • 9. The PageRank Citation Ranking: Bringing Order to the Web Larry Page etc. Stanford University
  • 10. PageRank----Idea Every page has some number of forward links(outedges) and backlinks(inedges)
  • 11.
  • 12.
  • 13. PageRank----Definition u: a web page F u : set of pages u points to B u : set of pages that point to u N u =|F u |: the number of links from u c: a factor used for normalization The equation is recursive, but it may be computed by starting with any set of ranks and iterating the computation until it converges.
  • 14. PageRank----definition A problem with above definition: rank sink If two web pages point to each other but to no other page, during the iteration, this loop will accumulate rank but never distribute any rank.
  • 15. PageRank----definition Definition modified: E(u) is some vector over the web pages(for example uniform, favorite page etc.) that corresponds to a source of rank. E(u) is a user designed parameter.
  • 16.
  • 17.
  • 18.