This document summarizes a research paper that proposes a video indexing and retrieval method using shot boundary detection and audio track detection. It first extracts keypoints from divided frames to create a new frame sequence. Support vector machines are then used to match keypoints between frames to detect different types of shot transitions. Audio energy is also analyzed to detect sound tracks. The method aims to reduce computational costs by removing non-boundary frames and representing transition frames as thumbnails. It was tested on CCTV and film videos.