This document presents an algorithm for keyword extraction and document clustering using statistical approaches and an agglomerative method. The algorithm consists of two parts: the extraction of significant keywords through statistical analysis and clustering the documents based on those keywords, achieving over 90% accuracy. The methodology includes the preprocessing of documents, keyword extraction using various statistical methods, and hierarchical clustering to group similar documents.