This paper presents a method for detecting whether individuals are wearing masks using deep learning models, specifically a two-stage CNN architecture with MobileNetV2 and VGG16. The system is trained on a dataset of images of masked and unmasked faces, achieving high accuracy rates for real-time detection in crowded environments. Future enhancements include improving detection of improper mask usage and expanding the classification categories.