This document summarizes an approach for identifying similar segments in social speech using machine learning segmentation techniques. It discusses: 1) Creating queries from human transcripts and indexing recordings using an IR platform after preprocessing. 2) Segmenting recordings regularly into overlapping passages or using machine learning classification trees trained on human transcripts to identify segment boundaries. 3) Features and models used for the machine learning segmentation of beginnings and ends of segments. 4) Evaluation results showing regular segmentation on ASR transcripts achieved the overall best performance.