Hosting
Wednesday, February 5, 2025
Google search engine
HomeArtificial IntelligenceResearchers say an AI-powered transcription tool used in hospitals is making up...

Researchers say an AI-powered transcription tool used in hospitals is making up things no one has ever said


SAN FRANCISCO (AP) — Tech giant OpenAI has touted its artificial intelligence-powered transcription tool Whisper as having human-level robustness and accuracy.

But Whisper has a major drawback: It has a tendency to make up chunks of text or even entire sentences, according to interviews with more than a dozen software engineers, developers and academic researchers. Those experts said some of the made-up text — known in the industry as hallucinations — could include racist commentary, violent rhetoric and even imagined medical treatments.

Experts said such fabrications are problematic because Whisper is used in a range of industries around the world to translate and transcribe interviews, generate text in popular consumer technologies and create subtitles for videos.

More worrying, they said, is the rush by medical centers to use Whisper-based tools to transcribe patients’ consultations with doctors, despite OpenAI’s warnings that the tool should not be used in “high-risk domains.”

The full extent of the problem is difficult to know, but researchers and engineers said they often encountered Whisper’s hallucinations in their work. For example, a University of Michigan researcher conducting a study of public meetings said he found hallucinations in eight of 10 audio transcripts he checked before he began trying to improve the model.

A machine learning engineer said he initially discovered hallucinations in about half of the more than 100 hours of Whisper transcripts he analyzed. A third developer said he found hallucinations in almost every one of the 26,000 transcriptions he made with Whisper.

The problems persist even with well-recorded, short audio clips. A recent study by computer scientists revealed 187 hallucinations in the more than 13,000 clear audio samples they examined.

That trend would lead to tens of thousands of incorrect transcriptions of millions of recordings, researchers said.

Such mistakes could have “very serious consequences,” especially in hospitals, said Alondra Nelson, who led the White House Office of Science and Technology Policy for the Biden administration until last year.

“No one wants a misdiagnosis,” says Nelson, a professor at the Institute for Advanced Study in Princeton, New Jersey. “There should be a higher bar.”

Whispering is also used to create captions for the deaf and hard of hearing – a population at particular risk for incorrect transcriptions. That’s because deaf and hard of hearing people can’t identify that fictions are “hidden among all these other texts,” says Christian Vogler, who is deaf and directs Gallaudet University’s Technology Access Program.



Source link

RELATED ARTICLES
- Advertisment -
Google search engine

Most Popular