Google's new medical AI scores 86.5% on medical exam. Human doctors preferred its outputs over actual doctor answers.

A new study from Google’s research division shows that Med-PaLM 2, their AI language model specifically trained in medical knowledge, scored an astounding 86.5% on a question set styled after the US Medical Licensing Examination (USMLE), well surpassing the typical 60% pass threshold for human examinees. More importantly, a panel of human doctors consistently preferred Med-PaLM 2's answers to those offered by actual physicians, a sign of the massive leaps in progress AI models have made in mere months.

"Answering medical questions by applying medical knowledge and reasoning at a level comparable to doctors has long been seen as a significant challenge," the researchers observed. Their study’s findings represent substantial progress towards this ambitious goal.


Robot doctors are on the way
 

Trending

Top