This document summarizes a research paper on Fashion AI. It proposes a new Group Decreasing Network (GroupDNet) that uses group convolutions in the generator and gradually reduces the percentage of groups in the decoder's convolutions. This allows the model to have more control over generating images from semantic labels and produce high-quality, multi-modal outputs. The paper describes GroupDNet's architecture, compares it to other approaches like using multiple generators, and shows it outperforms other methods on challenging datasets based on metrics like FID and mIoU. Potential applications discussed include mixed fashion styles, semantic manipulation, and tracking fashion trends over time. The conclusion discusses GroupDNet's performance but notes room for improving computational efficiency