The Best Measure of Progress Is Working Code

The way we measure progress in AI is terrible

Many of the most popular benchmarks for AI models are outdated or poorly designed. Every time a new AI model is released, it’s typically touted as acing its performance against a series of benchmarks.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

The way we measure progress in AI is terrible

Trending now