Statisticians call for rigor and transparency in the evaluat