This document discusses building a finite state transducer (FST) for efficient dictionary lookups during tokenization. It describes building the FST by iterating through a word list, freezing states when word suffixes differ, and merging equivalent states. The built FST is then compiled into a program that can be executed by a virtual machine to lookup words. The program represents the FST as a list of instructions including transition characters and output values. By running the program backwards, it simulates traversing the FST from a word to an output.