OpenAI Unveils HealthBench: An Advanced Medical Large Model Test Set, Demonstrating Remarkable Performance Enhancements - AI

7 x 24 Track global technological trends

Hot Topic

Day

News Topic

OpenAI Unveils HealthBench: An Advanced Medical Large Model Test Set, Demonstrating Remarkable Performance Enhancements

2025-05-13 / Read about 0 minute

Author：小编

OpenAI has introduced HealthBench, an open-source evaluation set tailored for testing large medical models. Curated by 262 physicians worldwide, HealthBench encompasses a meticulous set of 48,562 scoring criteria and employs multi-round dialogue testing to closely mimic real-world medical scenarios. This innovative tool has significantly boosted the performance of AI systems in healthcare, with GPT-4.1nano surpassing GPT-4o while achieving a cost reduction of 25 times.

Previous page：Yang Yuanqing: Lenovo Can Adapt Swiftly and Effici...

Next page：Renowned Actress Jamie Lee Curtis Urges Meta to Re...

Return to List

Hot Reading

2 day ago

Anthropic's New Founder Playbook Argues AI Has "Rebooted" the Startup Lifecycle — Here's What Holds Up

2 day ago

OpenAI Launches $4 Billion Enterprise AI Deployment Venture, Recruits McKinsey and Capgemini as Co-Funders

1 day ago

Google I/O 2026 Keynote Opens Tuesday as New Gemini Lands Behind Mythos and GPT-5.5

2 day ago

Lotte Energy Materials Bets Big on AI Substrate Circuit Foil: A High-Value Strategic Pivot

2 day ago

Intel Core Ultra 5 250K Plus vs AMD Ryzen 5 7600X3D faceoff — Battle for the fastest mid-range gaming CPU

2 day ago

The haves and have nots of the AI gold rush

2 day ago

The Core Ultra 7 270K was too good, so Intel scrapped the flagship Core Ultra 9 290K Plus

2 day ago

Open-Design: Free Local Alternative to Claude Design's $20 Plan Runs 16 AI Agents

3 day ago

Research repository ArXiv will ban authors for a year if they let AI do all the work

1 day ago

Apple’s Siri revamp could include auto-deleting chats

Previous page：Yang Yuanqing: Lenovo Can Adapt Swiftly and Effici...

Next page：Renowned Actress Jamie Lee Curtis Urges Meta to Re...

C114 Communication Network
Communication Home

7 X 24 Track global technological trends

Find

News Topic

Hot Topic

7 x 24 Track global technological trends

News Flash

News Topic

AI
/
Devices
/
Smart Car
/
Chip
/
Cloud

C114 Communication Network

Communication Home