The document presents a comparative analysis of web document clustering algorithms to determine the most efficient method for retrieving information from large databases on the web. It evaluates several algorithms, including Suffix Tree Clustering (STC), K-means, Agglomerative Hierarchical Clustering (AHC), Fractionation, and Semantic Hierarchical Online Clustering (SHOC), based on parameters of retrieval time and document relevancy. The findings suggest that the STC algorithm performs best in terms of maximizing retrieved documents while minimizing processing time.