Embed presentation
Download to read offline

















This document discusses text mining and fuzzy document classification using Elasticsearch. It describes building document class vectors for job roles like "Java programmer" and ".NET programmer" that include skills and technologies weighted by importance. The document demonstrates classifying an unknown document by tokenizing it, comparing the tokens to classification vectors, and calculating cosine similarity distances to classify the document.















