The ever-increasing impact of neurocognitive diseases is more and more apparent, as statistics show that due to the longer life expectancy established today, minor (Mild Cognitive Impairment-MCI) and major cognitive diseases (Dementia) will soon be a societal problem that cannot be ignored. Most of the currently established methods of neurodegeneration diagnosis are either invasive (blood tests, neuroimaging) and/or require a full neuropsychological and clinical assessment, which is performed in a clinical environment and usually requires a lot of time. To make the diagnosis process simpler, studies exist that focus on the speech decline which usually accompanies the cognitive one, so as to classify people according to their cognitive status, often by collecting speech data from structured interviews and deploying a machine learning model. In this study, the validity of a multiclass classification process is examined, aiming to robustly differentiate between earlier stages of the clinical spectrum of aging. Τhe target classes of this study comprise Healthy controls, Subjective Cognitive Decline (SCD), Early-MCI (E-MCI), Late-MCI (L-MCI). To collect data, 84 persons, aged 50 to 85, were recorded at the Greek Association of Alzheimer’s Disease and Related Disorders (GAADRD) center “Agia Eleni”, collecting a total of 1621 recordings along with their personal information. The recording process consisted of 5 different stages having the format of an informal interview with questions and dual-task prompts, so as to steadily increase the required cognitive effort, aiming at examining the performance differences across the stages. Three different types of audio features were extracted: silence features, prosodic features, and zero-crossings features. To quantify the changes in the participants’ speech between stages, a new feature vector was formed by subtracting the individual feature vectors between stages. The features per stage as well as the new features were evaluated with three classifiers, namely Random Forest, Extra-Trees and Support Vector Machines. Three sets of experiments were conducted according to the split of data in test and train data. First two sets consist of experiments in a 4-classes-classification as described, with random split of instances and split of instances per person accordingly, while the 3rd set consists of binary classifiers for further examination of the models’ distinctive ability. Different experiments were conducted, where models created by utilizing stage differences, features per stage, or even used in an ensemble majority voting system. (...)