Health ❯Healthcare ❯Artificial Intelligence
HealthBench Dataset
The newly released dataset evaluates AI models' medical response accuracy, revealing top performers and raising concerns over grading transparency and safety validation.