This document describes the design of a 32-bit parallel multiplier using VHDL. It compares the use of carry save adders (CSA) versus carry lookahead adders (CLA) in the partial product lines. CSA is used to add each group of partial products in parallel. Simulation waveforms are shown for half adders, full adders, and 4, 8, and 32-bit multipliers. Output is also shown on an FPGA for half and full adders. The conclusion is that using CSA in the partial product lines improves performance over other adders in terms of speed and efficiency for large multiplications.