This document proposes and evaluates different approaches for performing real-time 3D wavelet lifting. It finds that a true 3D approach that processes all dimensions within a single loop outperforms naive approaches that process each dimension sequentially. This true 3D approach achieves an 11.7x speedup on Intel Core2 and 8x speedup on AMD Opteron CPUs compared to the best naive approach due to being more CPU cache and SIMD friendly.