This document proposes an efficient privacy-preserving clustering-based multi-keyword search system. It uses hierarchical clustering to generate clusters of encrypted documents in the cloud. The system aims to improve search efficiency while maintaining security. It utilizes EM clustering, SHA-1 hashing for deduplication, and a user revocation method. Experimental results show the framework has advantages such as efficient memory and time utilization, secure search over encrypted data, secure data storage, and deduplication.