This document describes how to build a simple automatic speaker recognition system. It discusses the principles of speaker recognition, which can be identification (determining which registered speaker is speaking) or verification (accepting or rejecting a speaker's claimed identity). The key components are feature extraction and feature matching. Feature extraction converts the speech waveform into features using techniques like MFCC. Feature matching then compares the extracted features to stored reference models to identify the speaker. The document focuses on the speech feature extraction process, which involves framing the speech signal, windowing frames, taking the FFT, and calculating MFCCs to characterize the signal in a way that mimics human hearing.