Embed presentation
Download as PDF, PPTX













This document discusses two papers related to neural networks and video understanding. The first paper proposes a non-local neural network that uses a pairwise function to model long-range dependencies. The second paper presents TVNet, which uses an end-to-end learning approach with a TV-L1 algorithm to learn motion representations for video understanding tasks. Experiments are conducted to evaluate the proposed methods.












