The document discusses the Matbench testing protocol for evaluating machine learning models for materials property prediction. It summarizes the 13 different machine learning tasks in Matbench and the various models that have been tested, including Magpie, Automatminer, MODNet, CGCNN, ALIGNN, and CRABNet. The document outlines ways Matbench could be further improved, such as including a greater diversity of tasks, changing the data splitting methodology, and incorporating active learning into the scoring. The overall goal of Matbench is to provide a standard way to evaluate new machine learning algorithms for materials property prediction and measure progress in the field.