TERMite is a semantic indexing engine that analyzes raw text at speeds of up to 1 million words per second, extracting structured data and enabling new discoveries. It manages ambiguity in scientific names. Supporting TERMite is a collection of over 80 vocabularies spanning life sciences, containing over 20 million synonyms, which are enriched through automated analysis and manual curation. TERMite can distinguish relevant terms from irrelevant mentions, identify patterns of entities such as gene-disease relationships, and enhance semantic search and discovery.