This document discusses a method for age estimation using deep convolutional neural networks (CNNs) with distribution-based loss functions. Two CNN models with different architectures are trained separately on different datasets and then fine-tuned on a competition dataset. Distribution-based loss functions like KL divergence are used to exploit the uncertainty in age labels from multiple annotators. The results show this method achieves better age estimation performance than human-level benchmarks, with a mean absolute error of 0.3057 years.