Hospitals adopt error-prone AI transcription tools despite warnings

You May Be Interested In:Errant reference in macOS 15.2 seems to confirm M4 MacBook Airs for 2025



In one case from the study cited by AP, when a speaker described “two other girls and one lady,” Whisper added fictional text specifying that they “were Black.” In another, the audio said, “He, the boy, was going to, I’m not sure exactly, take the umbrella.” Whisper transcribed it to, “He took a big piece of a cross, a teeny, small piece … I’m sure he didn’t have a terror knife so he killed a number of people.”

An OpenAI spokesperson told the AP that the company appreciates the researchers’ findings and that it actively studies how to reduce fabrications and incorporates feedback in updates to the model.

Why Whisper confabulates

The key to Whisper’s unsuitability in high-risk domains comes from its propensity to sometimes confabulate, or plausibly make up, inaccurate outputs. The AP report says, “Researchers aren’t certain why Whisper and similar tools hallucinate,” but that isn’t true. We know exactly why Transformer-based AI models like Whisper behave this way.

Whisper is based on technology that is designed to predict the next most likely token (chunk of data) that should appear after a sequence of tokens provided by a user. In the case of ChatGPT, the input tokens come in the form of a text prompt. In the case of Whisper, the input is tokenized audio data.

The transcription output from Whisper is a prediction of what is most likely, not what is most accurate. Accuracy in Transformer-based outputs is typically proportional to the presence of relevant accurate data in the training dataset, but it is never guaranteed. If there is ever a case where there isn’t enough contextual information in its neural network for Whisper to make an accurate prediction about how to transcribe a particular segment of audio, the model will fall back on what it “knows” about the relationships between sounds and words it has learned from its training data.

share Paylaş facebook pinterest whatsapp x print

Similar Content

OpenGarage unit on the roof, with a garage door open below it (to the right in frame)
I, too, installed an open source garage door opener, and am loving it
A view of the Bell Tower on the campus of North Carolina State University in Raleigh, North Carolina.
HowStuffWorks founder Marshall Brain sent final email before sudden death
In this photo illustration the American social news
Reddit’s getting more popular—and profitable
SAN FRANCISCO - SEPTEMBER 20: Freshly printed copies of the San Francisco Chronicle run through the printing press at one of the Chronicle
Nexus review: Yuval Noah Harari is out of his depth in his new book
Tiny nuclear-powered battery could work for decades in space or at sea
Tiny nuclear-powered battery could work for decades in space or at sea
DNA computer can play chess and solve sudoku puzzles
DNA computer can play chess and solve sudoku puzzles
The News Spectrum | © 2024 | News