This document discusses clustering algorithms for large datasets that do not fit into main memory. It introduces the Relational K-Means (RKM) algorithm, which limits disk I/O by assigning data points in batches and updating cluster centroids after only 3 iterations. RKM stores cluster assignment and centroid data in matrices on disk and minimizes I/O by accessing matrix rows sequentially. An evaluation shows RKM outperforms standard K-means on large datasets due to its ability to handle data that does not fit in memory through efficient disk access. However, RKM does not address all limitations of K-means clustering.