Denoising Diffusion Probabilistic Model Contrastive models like CLIP as a key inspiration. Demonstrates robust image representations capturing both semantics and style. Project Objectives: Two-stage model proposed: Prior generating a CLIP image embedding from a given text. Decoder generating an image based on these CLIP image embeddings.