Is Citing a Single Benchmark Score Holding Your Team Back?
https://files.fm/u/hdnp8v7h9h
Engineers, product managers, and researchers often treat a single benchmark number as if it were definitive proof that one model or system is superior
Engineers, product managers, and researchers often treat a single benchmark number as if it were definitive proof that one model or system is superior