This paper presents a hardware-efficient discrete wavelet transform (DWT) architecture using a modified lifting scheme and Wallace tree multiplier for low power and high-speed operations, implemented on an FPGA device. The design achieved operational frequencies of 520.738 MHz with power consumption below 0.103W, making it suitable for real-time data processing applications. Various architectural solutions for DWT are compared, highlighting the proposed architecture's advantages in performance and reduced computational complexity.