Why AI guesses instead of admitting uncertainty

Introduction

Modern chatbots often generate fluent answers even when the evidence behind a claim is weak or missing. A key reason is that many language models are trained, evaluated, and rewarded in ways that make answering appear more valuable than admitting uncertainty. In practice, a chatbot may receive more benefit from producing a plausible response than from saying “I don’t know”, even when uncertainty is high. Researchers increasingly argue that this behaviour is not simply a bug but a predictable consequence of how large language models are built, tested, and compared. [OpenAI]OpenAIwhy language models hallucinate5 Sept 2025 —… language models hallucinate because standard training and evaluation procedures reward guessing over acknowledging unce…

Guessing illustration 1 Understanding this mechanism helps explain why confident-sounding mistakes occur. The issue is not that chatbots deliberately deceive users. Rather, they are often optimised to maximise successful answers, and that optimisation can push them towards guessing when information is incomplete. [arXiv]arxiv.orgarXiv Why Language Models HallucinatearXiv Why Language Models Hallucinate

How answer-first incentives shape chatbot behaviour

Large language models are trained to generate likely continuations of text. During pretraining, the system learns statistical patterns from enormous datasets. The model’s basic task is not to determine whether a statement is true but to predict what text is likely to come next. This means the model is naturally geared towards producing an answer rather than withholding one. [Arize AI]arize.comopenais santosh vempala explains why language models hallucinateArize AIOpenAI's Santosh Vempala Explains Why Language…Oct 24, 2025 — “Pre-training encourages hallucinations because language models…

The pressure to answer becomes stronger during later training stages. Developers typically want chatbots that are helpful, responsive, and capable of addressing a wide range of questions. Users often prefer a model that attempts an answer over one that repeatedly refuses. As a result, training and tuning processes frequently reward useful-looking responses and penalise excessive refusals. [OpenAI]OpenAIwhy language models hallucinate5 Sept 2025 —… language models hallucinate because standard training and evaluation procedures reward guessing over acknowledging unce…

A useful analogy is an exam. Imagine a student facing a difficult multiple-choice question. If there is no penalty for guessing, selecting an answer can produce a better score than leaving the question blank. Researchers have argued that many AI evaluation systems create a similar incentive structure. Under those conditions, guessing becomes a rational strategy from the model’s perspective. [arXiv+2OpenAI CDN]arxiv.orgarXiv Why Language Models HallucinatearXiv Why Language Models Hallucinate

Why abstaining can score worse than guessing

The strongest recent explanation for chatbot guessing focuses on evaluation metrics. Many benchmarks judge systems by how many questions they answer correctly. A wrong answer receives a poor score, but an abstention often receives little or no credit. Because some guesses will inevitably be correct, a model that answers aggressively can outperform a model that abstains frequently. [arXiv]arxiv.orgarXiv Why Language Models HallucinatearXiv Why Language Models Hallucinate

This creates a subtle but powerful feedback loop:

Models that answer more questions often achieve higher benchmark scores.
Higher benchmark scores influence research priorities, rankings, and model development.
Training methods evolve to improve those scores.
Behaviour that favours answering becomes reinforced.

Researchers behind the paper Why Language Models Hallucinate argue that this incentive structure helps explain why hallucinations persist even as models become more capable. In their view, many systems are effectively being trained to become excellent test-takers rather than excellent judges of when knowledge is missing. [arXiv+2arXiv]arxiv.orgarXiv Why Language Models HallucinatearXiv Why Language Models Hallucinate

This also explains why a chatbot may produce a detailed response to a question that lacks sufficient information. From the model’s learned perspective, generating a plausible answer can be statistically advantageous compared with refusing to answer. [IBM]ibm.comThe hidden incentives driving AI hallucinationsWhat happens when large language models are trained to provide wrong answers instead of…

Guessing illustration 2

Why uncertainty is difficult for language models

Humans can often recognise when they lack knowledge and communicate that uncertainty explicitly. Language models do not possess this capability in the same way. They generate text by estimating probabilities over possible continuations rather than consulting a separate internal database of verified facts. [Wall Street Journal]wsj.comThese models generate responses using educated guesses without expressing uncertainty, often leading to errors known as hallucinations. T…

As a result, uncertainty is not automatically translated into phrases such as “I don’t know” or “I am not sure”. The model may simply continue generating the most statistically plausible sequence of words available. When evidence is sparse, this process can produce answers that sound coherent while lacking reliable factual support. [Arize AI]arize.comopenais santosh vempala explains why language models hallucinateArize AIOpenAI's Santosh Vempala Explains Why Language…Oct 24, 2025 — “Pre-training encourages hallucinations because language models…

Another challenge is calibration. A model may have internal signals indicating uncertainty, but those signals do not always appear clearly in the final response. The output can therefore sound confident even when the underlying prediction process is uncertain. [arXiv]arxiv.orgUncertainty-Based Abstention in LLMs Improves Safety and Reduces HallucinationsApril 17, 2024…Published: April 17, 2024

What better uncertainty signals would look like

Researchers are increasingly exploring ways to reward restraint instead of guessing. One approach is abstention: allowing a model to decline answering when confidence falls below a threshold. Studies have found that selective abstention can significantly reduce hallucinations and improve reliability, particularly when questions are unanswerable or lack sufficient context. [arXiv]arxiv.orgUncertainty-Based Abstention in LLMs Improves Safety and Reduces HallucinationsApril 17, 2024…Published: April 17, 2024

Several improvements have been proposed:

Explicit uncertainty statements that indicate confidence levels rather than presenting all answers with the same tone.
Abstention mechanisms that allow the model to refuse when evidence is insufficient.
Evidence-linked answers that show where information came from instead of relying solely on generated text.
Evaluation systems that reward honesty, giving credit for correctly identifying uncertainty rather than encouraging speculative answers. [OpenAI+2arXiv]OpenAIwhy language models hallucinate5 Sept 2025 —… language models hallucinate because standard training and evaluation procedures reward guessing over acknowledging unce…

Recent research efforts have also explored systems that encourage models to report possible mistakes or uncertainty after producing an answer. The goal is not merely to improve accuracy but to make uncertainty visible to users. [Tom's Guide]tomsguide.comTom's Guide Open AI is teaching AI models to 'confess' when they hallucinateRather than making models more self-aware or accurate, this method adds a secondary output—called a "ConfessionReport"—where the AI evalu…

Guessing illustration 3

The trade-off between usefulness and restraint

Although saying “I don’t know” sounds like an obvious solution, the issue is more complicated. A chatbot that refuses too often becomes less useful. A model that never takes informed risks may avoid some mistakes but also fail to help users when partial information could still be valuable. [Business Insider]businessinsider.comThis test-centric optimization encourages models to provide confident but potentially incorrect outputs, rather than abstaining when unsu…

Developers therefore face a balancing problem. They want systems that answer confidently when evidence is strong, acknowledge uncertainty when evidence is weak, and avoid inventing information when evidence is absent. Achieving that balance requires changes not only to model training but also to the benchmarks and reward systems used to judge success. [arXiv+2OpenAI]arxiv.orgarXiv Why Language Models HallucinatearXiv Why Language Models Hallucinate

The central lesson is that chatbot guessing is not primarily a personality trait or design quirk. It emerges from incentives. When answering is rewarded more than admitting uncertainty, models learn to answer. When uncertainty is recognised and rewarded appropriately, models become more willing to say what users sometimes most need to hear: “I don’t know.” [OpenAI+2arXiv]OpenAIwhy language models hallucinate5 Sept 2025 —… language models hallucinate because standard training and evaluation procedures reward guessing over acknowledging unce…

Amazon book picks

Marketplace Samples

Live-tested eBay searches with available results related to this page.

Top pick

technology mug

Open a targeted eBay search for items related to this topic.

TechnologyMug

Browse eBay

Related search

AI safety poster

Open a targeted eBay search for items related to this topic.

AiSafetyPoster

Browse eBay

Related search

artificial intelligence poster

Open a targeted eBay search for items related to this topic.

ArtificialIntelligencePoster

Browse eBay

Related search

machine learning poster

Open a targeted eBay search for items related to this topic.

MachineLearningPoster

Browse eBay

Browse more on eBay

Example items shown for inspiration; availability and pricing can change. Branchoria may earn a commission if you purchase through outbound eBay links.

Endnotes

Source: OpenAI
Title: why language models hallucinate
Link: https://openai.com/index/why-language-models-hallucinate/
Source snippet
5 Sept 2025 —... language models hallucinate because standard training and evaluation procedures reward guessing over acknowledging unce...
Source: arxiv.org
Title: arXiv Why Language Models Hallucinate
Link: https://arxiv.org/abs/2509.04664
Source: arxiv.org
Link: https://arxiv.org/html/2509.04664v1
Source snippet
Why Language Models Hallucinate4 Sept 2025 — We argue that language models hallucinate because the training and evaluation procedures rew...
Source: arize.com
Title: openais santosh vempala explains why language models hallucinate
Link: https://arize.com/blog/openais-santosh-vempala-explains-why-language-models-hallucinate/
Source snippet
Arize AIOpenAI's Santosh Vempala Explains Why Language...Oct 24, 2025 — “Pre-training encourages hallucinations because language models...
Source: cdn.openai.com
Title: why language models hallucinate
Link: https://cdn.openai.com/pdf/d04913be-3f6f-4d2b-b283-ff432ef4aaa5/why-language-models-hallucinate.pdf
Source snippet
OpenAI CDNWhy Language Models Hallucinateby AT Kalai · 2025 · Cited by 356 — When uncertain, students may guess on multiple-choice exams...
Source: ibm.com
Link: https://www.ibm.com/think/news/hidden-incentives-driving-ai-hallucinations
Source snippet
The hidden incentives driving AI hallucinationsWhat happens when large language models are trained to provide [wrong answers]({{ 'wrong-answers/' | relative_url }}) instead of...
Source: arxiv.org
Link: https://arxiv.org/abs/2404.10960
Source snippet
Uncertainty-Based Abstention in LLMs Improves Safety and Reduces HallucinationsApril 17, 2024...

Published: April 17, 2024
Source: arxiv.org
Link: https://arxiv.org/pdf/2509.04664
Source snippet
Like students facing hard exam questions, large language models sometimes guess when uncertain, producing plausible yet incorrect...Read...
Source: community.openai.com
Title: why language models hallucinate openai research paper
Link: https://community.openai.com/t/why-language-models-hallucinate-openai-research-paper/1356581
Source snippet
Models... language models hallucinate because standard training and evaluation procedures reward guessing over acknowledging uncertainty...
Source: wsj.com
Link: https://www.wsj.com/tech/ai/ai-halluciation-answers-i-dont-know-738bde07
Source snippet
These models generate responses using educated guesses without expressing uncertainty, often leading to errors known as hallucinations. T...
Source: businessinsider.com
Link: https://www.businessinsider.com/why-ai-chatbots-hallucinate-openai-chatgpt-anthropic-claude-2025-9
Source snippet
This test-centric optimization encourages models to provide confident but potentially incorrect outputs, rather than abstaining when unsu...
Source: euronews.com
Title: Why do AI models make things up or hallucinate?
Link: https://www.euronews.com/next/2025/09/09/why-do-ai-models-make-things-up-or-hallucinate-openai-says-it-has-the-answer-and-how-to-pr
Source snippet
OpenAI...9 Sept 2025 — The reason hallucinations continue is because LLMs are “optimised to be good test-takers and guessing when uncert...
Source: livescience.com
Title: Live Science AI hallucinates more frequently as it gets more advanced
Link: https://www.livescience.com/technology/artificial-intelligence/ai-hallucinates-more-frequently-as-it-gets-more-advanced-is-there-any-way-to-stop-it-from-happening-and-should-we-even-try
Source snippet
OpenAI's latest reasoning models, o3 and o4-mini, showed higher hallucination rates than earlier versions, sparking concerns about the re...
Source: tomsguide.com
Title: Tom’s Guide Open AI is teaching AI models to ‘confess’ when they hallucinate
Link: https://www.tomsguide.com/ai/chatgpt/openai-is-teaching-ai-models-to-confess-when-they-hallucinate-heres-what-that-actually-means
Source snippet
Rather than making models more self-aware or accurate, this method adds a secondary output—called a "ConfessionReport"—where the AI evalu...
Source: computerworld.com
Link: https://www.computerworld.com/article/4059383/openai-admits-ai-hallucinations-are-mathematically-inevitable-not-just-engineering-flaws.html
Source snippet
OpenAI admits AI hallucinations are mathematically...18 Sept 2025 — “We argue that language models hallucinate because the training and...
Source: reddit.com
Title: Why Language Models Hallucinate
Link: https://www.reddit.com/r/MachineLearning/comments/1namvsk/why_language_models_hallucinate_openai_pseudo/
Source snippet
OpenAi pseudo paperEffectively the problem is that “guessing, even if not confident” yields better results at benchmarks than saying “I d...
Source: facebook.com
Link: https://www.facebook.com/asthait/posts/is-your-ai-hallucinatinggood-news-openais-latest-research-suggests-theyve-finall/1186833133467666/
Source snippet
OpenAI's latest research suggests they...Current benchmarks reward AI for guessing wildly rather than admitting uncertainty, much like a...
Source: thealgorithmicbridge.com
Title: openai researchers have discovered
Link: https://www.thealgorithmicbridge.com/p/openai-researchers-have-discovered
Source snippet
language models hallucinate because standard training and evaluation procedures reward guessing over acknowledging uncertainty.Read more...
Source: galileo.ai
Title: why language models hallucinate
Link: https://galileo.ai/blog/why-language-models-hallucinate
Source snippet
Understanding Why Language Models Hallucinate?Sep 8, 2025 — A recent paper from OpenAI "Why Language Models Hallucinate" claims to prove...

Additional References

Source: news.ycombinator.com
Link: https://news.ycombinator.com/item?id=45147385
Source snippet
language models hallucinatelanguage models can be trained to abstain when uncertain, by changing how rewards are set up. Incentives curre...
Source: linkedin.com
Link: https://www.linkedin.com/posts/zainhas_new-paper-from-openai-why-llms-hallucinate-activity-7369831862451531776-dY1n
Source snippet
New paper from OpenAI: "Why LLMs Hallucinate"We argue that language models hallucinate because the training and evaluation procedures rew...
Source: reddit.com
Link: https://www.reddit.com/r/singularity/comments/1n9fued/new_research_from_openai_why_language_models/
Source snippet
New research from OpenAI: "Why language models...Claim: Hallucinations are inevitable. Finding: They are not, because language models ca...
Source: medium.com
Link: https://medium.com/%40markus_brinsa/why-ai-models-always-answer-even-when-they-shouldnt-e95081e3f46b
Source snippet
Why AI Models Always Answer — Even When They Shouldn'tModern AI chatbots and large language models (LLMs) almost never admit “I don't kno...
Source: lifewire.com
Link: https://www.lifewire.com/ai-chatbot-hallucinations-7643422
Source snippet
AI's hallucinations are not indicative of sentience but are due to the complexities in training and fine-tuning language models with dive...
Source: sawantvishwajeet729.medium.com
Link: https://sawantvishwajeet729.medium.com/understanding-why-language-models-hallucinate-a-deep-dive-into-openais-latest-research-a5ccea95a327
Source snippet
Why Language Models Hallucinate: A Deep...The paper demonstrates that hallucinations aren't mysterious artifacts — they emerge naturally...
Source: linkedin.com
Link: https://www.linkedin.com/posts/haythamassem_why-language-models-hallucinatepdf-activity-7370201125955997697–izi
Source: youtube.com
Link: http://www.youtube.com/watch?v=qhCZsYufxqk
Source snippet
"An explanation of the “illusion of thinking” paper re: LLMs.[http://www.youtube.com/watch?v=T-SP_z_4-e0..."](http://www.youtube.com/watch?v=T-SP_z_4-e0...")...
Source: linkedin.com
Title: Why don’t AI models just say “I don’t know”?
Link: https://www.linkedin.com/posts/brahim-guaali_why-dont-ai-models-just-say-i-dont-know-activity-7371955112908455936-RkP2
Source snippet
Brahim GuaaliBecause current training and evaluation often reward guessing instead of admitting uncertainty. Accuracy-only benchmarks p...
Source: reddit.com
Link: https://www.reddit.com/r/ArtificialInteligence/comments/1qpd2qg/why_ai_chatbots_guess_instead_of_saying_i_dont/
Source snippet
arly doesn’t know the answer, it still gives you one instead of...

Why AI guesses instead of admitting uncertainty

Introduction

How answer-first incentives shape chatbot behaviour

Why abstaining can score worse than guessing

Why uncertainty is difficult for language models

What better uncertainty signals would look like

The trade-off between usefulness and restraint

Further Reading

Co-Intelligence

Hands-On Large Language Models

The AI Revolution in Medicine

Build a Large Language Model (From Scratch)

Marketplace Samples

technology mug

AI safety poster

artificial intelligence poster

machine learning poster

Endnotes

Additional References

Follow this branch

Parent topic

Related pages 2