You don't need all the LLM benchmarks
entityYou don't need all the LLM benchmarksWith 57 subjects analyzed, this startup questions the heavy reliance on LLM benchmarks. Critics assert that 'the columns are wildly correlated,' raising significant doubts about long-standing evaluation practices.