Opinion: Three ways to test medical AI for safety
The biggest challenge of using generative AI in medicine is the hallucination problem, in which the system just makes things up.
This article is adapted from “The AI Revolution in Medicine: GPT-4 and Beyond,” by Peter Lee, Carey Goldberg, and Isaac Kohane, published by Pearson.
“Thrashing.” That’s what old-school computer scientists called it when an operating system is running so many tasks at once that just switching among them basically crashes it. And that’s how I felt last fall when I tested GPT-4, the far more powerful successor to ChatGPT, on medical challenges for the first time. I was caught in a stuttering stasis between two competing, nearly overwhelming realizations.
What's Your Reaction?