The document reviews human activity recognition (HAR) systems, emphasizing the challenges in accurately identifying human activities due to factors like cluttered backgrounds and different viewpoints. It proposes a low-computational approach using the YOLO architecture for real-time recognition, enhancing accuracy by pre-processing video inputs and classifying activities through a convolutional neural network. The proposed method aims to optimize performance in various applications, including surveillance, while maintaining operational efficiency.