The document proposes a computationally efficient method for calculating the Maximum Mean Discrepancy (MMD) between two distributions based on random Fourier features. It shows that by using random features to approximate the kernel, the MMD can be computed in O(BM^2 + B^2) time, where B is the batch size and M is the number of random features, providing a significant speedup over direct kernel computation which takes O(B^3) time. It applies this efficient MMD to the task of voice conversion and demonstrates it achieves better voice quality and naturalness compared to other baselines, as measured by objective metrics and a mean opinion score user study.