Rethink reporting of evaluation results in AI Aggregate metrics and lack of access to results limit understanding
成果类型:
Editorial Material
署名作者:
Burnell, Ryan; Schellaert, Wout; Burden, John; Ullman, Tomer D.; Martinez-Plumed, Fernando; Tenenbaum, Joshua B.; Rutar, Danaja; Cheke, Lucy G.; Sohl-Dickstein, Jascha; Mitchell, Melanie; Kiela, Douwe; Shanahan, Murray; Voorhees, Ellen M.; Cohn, Anthony G.; Leibo, Joel Z.; Hernandez-Orallo, Jose
署名单位:
University of Cambridge; Universitat Politecnica de Valencia; University of Cambridge; Harvard University; Massachusetts Institute of Technology (MIT); University of Cambridge; Alphabet Inc.; Google Incorporated; Stanford University; Alphabet Inc.; DeepMind; Imperial College London; Imperial College London; University of Leeds; Alan Turing Institute; Tongji University; Shandong University
刊物名称:
SCIENCE
ISSN/ISSBN:
0036-10073
DOI:
10.1126/science.adf6369
发表日期:
2023-04-14
页码:
136-138
关键词:
reproducibility