Are bad incentives to blame for AI hallucinations?

https://techcrunch.com/feed/ Hits: 21

Summary

A new research paper from OpenAI asks why large language models like GPT-5 and chatbots like ChatGPT still hallucinate, and whether anything can be done to reduce those hallucinations. In a blog post summarizing the paper, OpenAI defines hallucinations as “plausible but false statements generated by language models,” and it acknowledges that despite improvements, hallucinations “remain a fundamental challenge for all large language models” — one that will never be completely eliminated. To illustrate the point, researchers say that when they asked “a widely used chatbot” about the title of Adam Tauman Kalai’s Ph.D. dissertation, they got three different answers, all of them wrong. (Kalai is one of the paper’s authors.) They then asked about his birthday and received three different dates. Once again, all of them were wrong. How can a chatbot be so wrong — and sound so confident in its wrongness? The researchers suggest that hallucinations arise, in part, because of a pretraining process that focuses on getting models to correctly predict the next word, without true or false labels attached to the training statements: “The model sees only positive examples of fluent language and must approximate the overall distribution.” “Spelling and parentheses follow consistent patterns, so errors there disappear with scale,” they write. “But arbitrary low-frequency facts, like a pet’s birthday, cannot be predicted from patterns alone and hence lead to hallucinations.” The paper’s proposed solution, however, focuses less on the initial pretraining process and more on how large language models are evaluated. It argues that the current evaluation models don’t cause hallucinations themselves, but they “set the wrong incentives.” The researchers compare these evaluations to the kind of multiple choice tests random guessing makes sense, because “you might get lucky and be right,” while leaving the answer blank “guarantees a zero.” Techcrunch event San Francisco | October 27-29, 2025 “...

First seen: 2025-09-07 21:41

Last seen: 2025-09-08 17:47

Read Full Article More from this Source

Are bad incentives to blame for AI hallucinations?

Summary

Related News

Uber will add Blade’s helicopters to its platform as early as 2026

This app just raised $14M to take on the loneliness epidemic

AI gaming startup Born raises $15M to build ‘social’ AI companions that combat loneliness

Robinhood embraces copy trading after warning competitors about regulatory risks

Sources: AI training startup Mercor eyes $10B+ valuation on $450 million run rate