This paper presents a novel approach for clustering big data using the k-prototype algorithm implemented on the MapReduce framework. The approach addresses the limitations of traditional clustering methods like k-means, which only works efficiently with numerical data, by allowing for the combination of numerical and categorical data. Experiments show that the proposed method significantly improves performance and reduces execution time when handling large-scale data compared to traditional algorithms.