Within Training data

When AI learns the wrong shortcut

Models can learn shortcuts that look useful in training data but collapse when the real-world setting changes.

On this page

  • How accidental patterns enter training data
  • Why shortcuts can pass ordinary tests
  • Examples of failures when settings change
Preview for When AI learns the wrong shortcut

Introduction

Artificial intelligence systems learn from patterns in data, not from an inherent understanding of the world. This creates a recurring problem: a model may discover a pattern that predicts the correct answer in its training data without learning the underlying reason the answer is correct. Researchers often call these misleading patterns spurious correlations or shortcuts. They can make an AI system look highly accurate during development while causing it to fail when conditions change. [arXiv]arxiv.orgarXiv Shortcut Learning in Deep Neural NetworksShortcut Learning in Deep Neural NetworksApril 16, 2020…Published: April 16, 2020

Shortcuts illustration 1 The danger is not that the model learns nothing useful. Rather, it learns a mixture of meaningful signals and accidental clues. As long as those clues remain present, performance may appear excellent. When the environment changes and the clues disappear, the model’s apparent intelligence can collapse surprisingly quickly. This is one of the main reasons AI systems that perform well in laboratories sometimes struggle in real-world deployment. [arXiv]arxiv.orgarXiv Shortcut Learning in Deep Neural NetworksShortcut Learning in Deep Neural NetworksApril 16, 2020…Published: April 16, 2020

How accidental patterns enter training data

Training datasets are created from real-world observations, and real-world data often contains hidden regularities. Some of those regularities reflect genuine relationships. Others are accidents of how the data was collected.

Imagine a dataset used to identify animals. If photographs of cows are usually taken in grassy fields, a model may discover that green textures are a strong predictor of the “cow” label. The grass is correlated with cows in the dataset, but grass is not what makes an animal a cow. A model that relies heavily on background scenery has learned a shortcut rather than the intended concept. [DeepLearning.ai]deeplearning.aiThe Causes of Shortcut Learning in Neural NetworksJun 3, 2020 — Dataset bias can cause models to focus on spurious correlations rather th…

The same mechanism appears across many domains:

  • Medical images may contain hospital-specific markings, scanner characteristics, or demographic patterns that happen to correlate with diagnoses.
  • Language models may learn that certain words frequently appear alongside particular labels, even when those words are not the true reason for the label.
  • Computer vision systems may rely on image backgrounds, lighting conditions, or camera artefacts instead of the objects they are supposed to recognise. [PMC+2IJCAI]pmc.ncbi.nlm.nih.govPMC“Shortcuts” Causing Bias in Radiology Artificial Intelligenceby I Banerjee · 2023 · Cited by 150 — In this review, the authors discuss…

A key insight from research on shortcut learning is that machine-learning systems are rewarded for finding whatever signal most efficiently reduces prediction errors. They are not automatically rewarded for finding the signal humans consider meaningful. [arXiv]arxiv.orgarXiv Shortcut Learning in Deep Neural NetworksShortcut Learning in Deep Neural NetworksApril 16, 2020…Published: April 16, 2020

Why shortcuts can pass ordinary tests

One reason spurious correlations are difficult to detect is that standard testing often uses data that resembles the training data.

If both training and testing datasets contain the same accidental pattern, a shortcut can continue producing correct answers. The model therefore receives a high score and appears reliable. Developers may conclude that the system has learned the intended concept when it has actually learned an easier proxy. [arXiv]arxiv.orgarXiv Shortcut Learning in Deep Neural NetworksShortcut Learning in Deep Neural NetworksApril 16, 2020…Published: April 16, 2020

Researchers describe this as a generalisation problem. The model has learned a rule that works within a particular data distribution but not outside it. A shortcut can therefore survive multiple rounds of evaluation because the evaluation environment contains the same hidden bias. [PMC]pmc.ncbi.nlm.nih.govPMCBeyond Distribution Shift: Spurious Features Throughby N Murali · 2023 · Cited by 18 — Deep Neural Networks (DNNs) are prone to learning spurious features that correlate with the label d…

This explains why accuracy numbers alone can be misleading. A model may achieve excellent benchmark performance while still relying on fragile signals. Studies of shortcut learning have shown that monitoring accuracy does not necessarily reveal whether a model has learned robust features or merely convenient correlations. [OpenReview]openreview.netShortcut Learning Through the Lens of Early Training…by N Murali · Cited by 3 — Summary Of The Paper: The paper studies the…

Why new settings expose the problem

The failure occurs when the environment changes.

Machine-learning researchers often refer to this as a distribution shift: the statistical patterns present during deployment differ from those seen during training. When that happens, shortcut-based predictions lose their value. [PMC]pmc.ncbi.nlm.nih.govPMCBeyond Distribution Shift: Spurious Features Throughby N Murali · 2023 · Cited by 18 — Deep Neural Networks (DNNs) are prone to learning spurious features that correlate with the label d…

Consider the difference between two types of features:

  • Core features are directly related to the task and remain useful across many situations.
  • Spurious features happen to correlate with the answer in one setting but may disappear in another. [PMC]pmc.ncbi.nlm.nih.govPMCBeyond Distribution Shift: Spurious Features Throughby N Murali · 2023 · Cited by 18 — Deep Neural Networks (DNNs) are prone to learning spurious features that correlate with the label d…

A model that has learned core features can adapt better to new circumstances because those features remain relevant. A model that depends on spurious features effectively has part of its reasoning removed when conditions change. The result may be sudden performance drops, unexpected mistakes, or behaviour that appears irrational to human observers. [arXiv]arxiv.orgarXiv Shortcut Learning in Deep Neural NetworksShortcut Learning in Deep Neural NetworksApril 16, 2020…Published: April 16, 2020

Researchers studying deep neural networks describe shortcuts as decision rules that work on standard benchmarks but fail under more challenging or realistic conditions. In other words, the model succeeds because the shortcut exists, not because it has learned the deeper concept developers intended. [arXiv]arxiv.orgarXiv Shortcut Learning in Deep Neural NetworksShortcut Learning in Deep Neural NetworksApril 16, 2020…Published: April 16, 2020

Shortcuts illustration 2

Examples of failures when settings change

Medical imaging and hospital-specific clues

Medical AI provides some of the clearest examples. Researchers have repeatedly found that diagnostic models can exploit non-medical signals present in training data.

In studies of chest X-rays, systems have sometimes learned information related to image acquisition, hospital identity, scanner characteristics, or other contextual cues instead of focusing solely on disease indicators. Such shortcuts can produce impressive results on familiar datasets yet perform poorly when applied to patients from different hospitals or imaging environments. [PMC+2PMC]pmc.ncbi.nlm.nih.govPMC“Shortcuts” Causing Bias in Radiology Artificial Intelligenceby I Banerjee · 2023 · Cited by 150 — In this review, the authors discuss…

During the COVID-19 pandemic, researchers showed that some AI systems for detecting COVID-19 from chest radiographs relied on shortcuts unrelated to the disease itself. These findings raised concerns that models appearing highly accurate during development might not generalise safely to new clinical settings. [MedRxiv]medrxiv.org2020.09.13.20193565v2.full textAI for radiographic COVID-19 detection selects shortcuts…by AJ DeGrave · 2020 · Cited by 893 — Using state-of-the-art technique…

Image recognition and background dependence

Vision systems often learn environmental cues because those cues are easier to detect than the object itself.

A model trained on photographs where wolves frequently appear in snowy environments may partly associate snow with the wolf label. If later presented with wolves in different surroundings, the system may become less reliable. Similar effects have been observed across many image-recognition tasks involving backgrounds, textures, and scene context. [DeepLearning.ai]deeplearning.aiThe Causes of Shortcut Learning in Neural NetworksJun 3, 2020 — Dataset bias can cause models to focus on spurious correlations rather th…

Research has also shown that image classifiers can become unusually sensitive to texture patterns. Instead of recognising the overall structure of an object, they may rely on surface characteristics that happen to correlate with the label in the training data. [DeepLearning.ai]deeplearning.aiThe Causes of Shortcut Learning in Neural NetworksJun 3, 2020 — Dataset bias can cause models to focus on spurious correlations rather th…

Language models and misleading word associations

Shortcut learning is not limited to images. In language tasks, models can learn that certain words strongly co-occur with particular labels and then over-rely on those words.

For example, a sentiment-analysis system may learn that specific terms frequently appear in positive reviews and use those terms as shortcuts for prediction. When the wording changes, even though the underlying meaning remains similar, performance can deteriorate because the shortcut no longer works. [arXiv]arxiv.orgOpen source on arxiv.org.

Why shortcuts are so attractive to AI systems

An important lesson from the research literature is that shortcuts are often the easiest patterns to learn.

If a simple feature predicts the correct label most of the time, optimisation algorithms naturally gravitate towards it. The model is rewarded for reducing errors quickly, and the shortcut may provide a fast route to high training accuracy. [OpenReview]openreview.netShortcut Learning Through the Lens of Early Training…by N Murali · Cited by 3 — Summary Of The Paper: The paper studies the…

This means that increasing model size or training time does not automatically solve the problem. More powerful models can sometimes become even better at discovering subtle accidental correlations hidden within large datasets. The issue originates in the relationship between the data and the learning objective, not merely in a lack of computational power. [arXiv]arxiv.orgarXiv Shortcut Learning in Deep Neural NetworksShortcut Learning in Deep Neural NetworksApril 16, 2020…Published: April 16, 2020

Shortcuts illustration 3

What this reveals about AI learning

Spurious correlations highlight a fundamental property of modern AI: models learn statistical relationships, not human intentions.

When a dataset contains accidental clues that consistently predict the correct answer, the model has little reason to ignore them. As a result, impressive performance can coexist with fragile understanding. The system may appear intelligent until it encounters a setting where the shortcut no longer applies. [arXiv]arxiv.orgarXiv Shortcut Learning in Deep Neural NetworksShortcut Learning in Deep Neural NetworksApril 16, 2020…Published: April 16, 2020

Understanding shortcut learning therefore helps explain a central challenge in artificial intelligence. The question is not only whether a model can find patterns, but whether it learns patterns that remain valid when the world changes. Spurious correlations matter because they allow a model to succeed for the wrong reason—and success for the wrong reason is often revealed only when the model faces a new environment. [arXiv+2PMC]arxiv.orgarXiv Shortcut Learning in Deep Neural NetworksShortcut Learning in Deep Neural NetworksApril 16, 2020…Published: April 16, 2020

Amazon book picks

Further Reading

Books and field guides related to When AI learns the wrong shortcut. Use these as the next step if you want deeper reading beyond the article.

eBay marketplace picks

Marketplace Samples

Example marketplace items related to this page. Use the search link to explore similar finds on eBay.

Using USA

Endnotes

  1. Source: arxiv.org
    Title: arXiv Shortcut Learning in Deep Neural Networks
    Link: https://arxiv.org/abs/2004.07780
    Source snippet

    Shortcut Learning in Deep Neural NetworksApril 16, 2020...

    Published: April 16, 2020

  2. Source: pmc.ncbi.nlm.nih.gov
    Title: PMCBeyond Distribution Shift: Spurious Features Through
    Link: https://pmc.ncbi.nlm.nih.gov/articles/PMC11029547/
    Source snippet

    by N Murali · 2023 · Cited by 18 — Deep Neural Networks (DNNs) are prone to learning spurious features that correlate with the label d...

  3. Source: deeplearning.ai
    Link: https://www.deeplearning.ai/the-batch/when-models-take-shortcuts/
    Source snippet

    The Causes of Shortcut Learning in Neural NetworksJun 3, 2020 — Dataset bias can cause models to focus on spurious correlations rather th...

  4. Source: ijcai.org
    Link: https://www.ijcai.org/proceedings/2023/0560.pdf
    Source snippet

    s often take dataset biases as a shortcut to make [decisions]({{ 'decisions/' | relative_url }}) rather than understand tasks, lead- ing to...Read more...

  5. Source: medrxiv.org
    Title: 2020.09.13.20193565v2.full text
    Link: https://www.medrxiv.org/content/10.1101/2020.09.13.20193565v2.full-text
    Source snippet

    AI for radiographic COVID-19 detection selects shortcuts...by AJ DeGrave · 2020 · Cited by 893 — Using state-of-the-art technique...

  6. Source: arxiv.org
    Link: https://arxiv.org/html/2412.05152v1
    Source snippet

    Navigating Shortcuts, Spurious Correlations, and...6 Dec 2024 — Shortcuts, also described as Clever Hans behavior, spurious correlations...

  7. Source: openreview.net
    Link: https://openreview.net/forum?id=5wa-ueGGI33
    Source snippet

    Shortcut Learning Through the Lens of Early Training...by N Murali · Cited by 3 — Summary Of The Paper: The paper studies the...

  8. Source: arxiv.org
    Link: https://arxiv.org/abs/2205.12593

  9. Source: pmc.ncbi.nlm.nih.gov
    Link: https://pmc.ncbi.nlm.nih.gov/articles/PMC12344020/
    Source snippet

    and Mitigating the Clever Hans Effect in Medical...by C Vásquez-Venegas · 2024 · Cited by 23 — The Clever Hans effect occurs when machin...

  10. Source: arxiv.org
    Link: https://arxiv.org/pdf/2004.07780
    Source snippet

    [Deep learning]({{ 'deep-learning/' | relative_url }}) has triggered the current rise of artificial intelligence and is the workhorse of today's machine intelligence.Read more...

  11. Source: openreview.net
    Link: https://openreview.net/forum?id=3UMzxqDcpY
    Source snippet

    ible to detect shortcut learning and locate where the shortcut is happening in a neural network.Read more...

  12. Source: youtube.com
    Title: Shortcut Learning in Deep Neural Networks
    Link: http://www.youtube.com/watch?v=D-eg7k8YSfs
    Source snippet

    Shortcut Learning - A generalization problem in deep neural networks...

  13. Source: youtube.com
    Title: Shortcut Learning
    Link: http://www.youtube.com/watch?v=rntdFo9y3L4
    Source snippet

    in AI (Deep Learning) - An Overview...

  14. Source: pmc.ncbi.nlm.nih.gov
    Link: https://pmc.ncbi.nlm.nih.gov/articles/PMC11192466/
    Source snippet

    PMC“Shortcuts” Causing Bias in Radiology Artificial Intelligenceby I Banerjee · 2023 · Cited by 150 — In this review, the authors discuss...

  15. Source: dictionary.cambridge.org
    Link: https://dictionary.cambridge.org/dictionary/english-chinese-traditional/spurious
    Source snippet

    in Traditional Chinese - Cambridge Dictionaryfalse and not what it appears to be, or (of reasons and judgments) based on something that h...

Additional References

  1. Source: dictionary.com
    Link: https://www.dictionary.com/browse/spurious
    Source snippet

    SPURIOUS Definition & Meaningnot genuine, authentic, or true; not from the claimed, pretended, or proper source; counterfeit. Synonyms: d...

  2. Source: en.wiktionary.org
    Link: https://en.wiktionary.org/wiki/spurious
    Source snippet

    spurious (comparative more spurious, superlative most spurious) False, not authentic, not genuine. Extraneous, stray; not relevant or wanted...

  3. Source: cds.ismrm.org
    Link: https://cds.ismrm.org/protected/23MProceedings/PDFfiles/0154_M9imKVQZg.html
    Source snippet

    Nature Machine Intelligence... (B) Proposed: By interrupting the shortcut, learning a spurious-correlation is avoided.Read more...

  4. Source: github.com
    Link: https://github.com/rgeirhos/shortcut-perspective
    Source snippet

    rted in the perspective article "Shortcut Learning in Deep Neural Networks"Read more...

  5. Source: youtube.com
    Title: Spurious Correlations and the Rise of Causal AI: A Guide for Investors
    Link: http://www.youtube.com/watch?v=OOUzdEN_Dew
    Source snippet

    This Shortcut Learning in Deep Neural Networks video outlines how artificial intelligence models pick up on accidental text or background...

  6. Source: linkedin.com
    Link: https://www.linkedin.com/pulse/unveiling-pitfalls-shortcut-learning-deep-neural-networks-tuiamat-icoae
    Source snippet

    nding factors that are not representative of the true task at...Read more...

  7. Source: dl.acm.org
    Link: https://dl.acm.org/doi/10.1145/3626772.3657729
    Source snippet

    acm.orgCOMI: COrrect and MItigate Shortcut Learning Behavior in...Jul 11, 2024 — In this paper, we propose COMI, a novel method to COrre...

  8. Source: techxplore.com
    Title: 2020 12 exploring notion shortcut deep neural
    Link: https://techxplore.com/news/2020-12-exploring-notion-shortcut-deep-neural.html
    Source snippet

    Exploring the notion of shortcut learning in deep neural...23 Dec 2020 — Geirhos and Michaelis believed that shortcut learning, the phen...

  9. Source: reddit.com
    Link: https://www.reddit.com/r/MachineLearning/comments/hxt85n/r_understanding_neural_network_shortcut_learning/
    Source snippet

    t discriminating consistent clues and inconsistent clues.Read more...

  10. Source: scispace.com
    Title: Shortcut learning in deep neural networks
    Link: https://scispace.com/papers/shortcut-learning-in-deep-neural-networks-1sq5hzzuwt?citations_page=63
    Source snippet

    Robert GeirhosIn this Perspective we seek to distil how many of deep learning's failures can be seen as different symptoms of the same un...

Topic Tree

Follow this branch

Parent topic

Training data Why the data teaches the model

Related pages 2