The document presents Facenet, a system for face recognition and clustering that learns a compact embedding from face images, directly linking distances to face similarity. It employs a deep convolutional network trained with a novel triplet loss and a unique online triplet mining method, achieving state-of-the-art performance on face recognition tasks with minimal data requirements. The system demonstrates significant efficiency and accuracy improvements over previous methods, setting new records on popular datasets.