This document presents the design of a 64 point fast Fourier transform (FFT) processor called Turbo64. It uses a radix-2 decimation-in-time algorithm to decompose a 64 point FFT into two 8 point FFTs. The architecture includes input, multiplier, constant buffer, and output units. The multiplier unit utilizes constant multiplication through hard-wired representations of constants. Fabricated in 0.25um BiCMOS, Turbo64 has a core area of 6.8 sq mm and power consumption of 41 mW, making it suitable for applications like OFDM used in wireless communications standards.