This paper reviews and compares traditional handcrafted techniques and deep learning approaches in sport video analysis, emphasizing their effectiveness in human activity recognition. It highlights the limitations of traditional methods, particularly their inability to generalize across datasets and their focus on low-level feature extraction. The deep learning methods, including CNN and LSTM, show promise in capturing high-level semantics and temporal information, though the field still requires further exploration and research.