The document presents a conference paper on enhancing multimedia data indexing through the innovative concept of composite visual words (cvw), which significantly reduces the number of required visual words while maintaining high retrieval performance. This approach was validated using standard datasets (Oxford and Paris buildings), showcasing that a much smaller vocabulary can achieve similar accuracy levels compared to methods using larger vocabularies. Future work is suggested to further test this methodology on larger datasets and explore alternative filtering strategies.