This document presents a new protocol for evaluating recommender systems, focusing on industrial applications and key functions such as helping users explore, decide, compare, and discover items. It discusses how to measure the effectiveness of these functions and highlights the importance of addressing performance variations across different user segments. The findings suggest that traditional metrics like RMSE alone may not be sufficient for evaluating the overall quality of a recommender system.