This document discusses an unsupervised learning approach to identify sub-genres in music scores. It explores different ways of representing musical features like pitch and timing in vector formats that can be analyzed using clustering algorithms. Evaluating different feature representations on a sample of folk tunes, the best results were obtained using a combined weighting of pitch, timing, beats extracted from audio files. This approach shows potential for applications like music information retrieval, studying musical genres and connections between tunes.