This document discusses gesture recognition technology. It begins with an overview of current gesture recognition devices and the hardware they use like depth cameras. It then discusses the challenges of localizing hands in images, recovering the coordinates of joints, and reconstructing the hand skeleton. The document outlines three main tasks: localization, regression of joint coordinates using convolutional neural networks, and skeleton inference to find the hand pose. The goal is to enable accurate and real-time gesture recognition using standard equipment.