Embed presentation
Download to read offline




































Tagger is a named entity recognition tool that can process over 1000 abstracts per second using a dictionary-based approach. It achieves 70-80% recall and 80-90% precision using comprehensive dictionaries, expansion rules, and a curated blacklist to identify entity types like genes, proteins, chemicals, and diseases. The tool has a C++ engine, is inherently thread-safe, and includes interactive annotation, Python wrappers, and a REST API.


































