Humanity's Last Exam is a language model benchmark consisting of 2,500 questions across a broad range of subjects. It was created jointly by the Center for AI Safety and Scale AI. From Wikipedia
The advanced AI model, praised for its reasoning and coding capabilities, is now available with limitations for free-tier users while paid users retain enhanced features.