Particle
.news
Technology
❯
Artificial Intelligence
❯
Large Language Models
Evaluation Metrics
FrontierMath
3 ARTICLES
5mo ago
FrontierMath Benchmark Reveals AI's Struggles with Complex Math