This document provides an overview of approaches for human activity recognition (HAR). It discusses vision-based approaches including using hand-crafted motion features, depth information methods, and deep learning techniques. Deep learning methods covered include two-stream inflated 3D networks, deep bidirectional LSTM with CNN features, and skeleton-based spatial temporal graph convolutional networks. The document also discusses potential future directions for HAR, which may involve using 3D features, unsupervised learning, skeleton-based methods for privacy, and optimization/quantization. HAR applications in the future could leverage the power of modern AI and embedded abilities while meeting requirements like real-time performance, privacy, and ease of deployment.