Technology ❯Artificial Intelligence ❯Large Language Models
FrontierMath
Epoch AI's new benchmark challenges AI models with problems requiring advanced reasoning, exposing significant limitations.