The document discusses the proposal of 'pro-VLAE', a progressive learning approach to enhance the learning and disentangling of hierarchical representations in variational autoencoders (VAEs). It demonstrates improved disentanglement metrics and superior performance compared to existing models like B-VAE and VLAE by progressively growing the network's capacity. Future work aims to strengthen guidance for information allocation across different abstraction levels.