This paper reviews the design of a file system architecture that employs cluster formation and query optimization for efficient data retrieval from large data sources. It compares the MapReduce paradigm with parallel database management systems (DBMS) and highlights the advantages of adopting clustering in query optimization to enhance performance. The research indicates that parallel DBMS outperforms MapReduce in execution times and energy efficiency while also discussing future implications for file management systems in enterprises.