Within Health Proxy
When equal risk scores hide unequal illness
Patients with the same algorithmic score could have very different illness burdens, revealing hidden racial disparity in care allocation.
On this page
- What equal scores revealed about sickness
- How under ranking affected programme eligibility
- Why group level checks matter before deployment
Page outline Jump by section
Introduction
One of the most important lessons in understanding artificial intelligence is that equal scores do not necessarily mean equal outcomes. In the widely discussed healthcare risk-scoring case, researchers discovered that Black patients and White patients who received the same algorithmic risk score often had very different levels of illness. The score appeared neutral, but patients with identical ratings were not equally sick. Black patients were typically carrying a substantially greater disease burden than White patients at the same predicted risk level. This finding revealed how an AI system can seem accurate overall while still producing unequal consequences for different groups. [Chicago Booth]chicagobooth.eduChicago BoothDissecting racial bias in an algorithm used to manage the health of populations - Tolan Center | Chicago Booth…
What equal scores revealed about sickness
The most striking evidence emerged when researchers stopped looking at overall prediction accuracy and instead compared patients who received the same risk score.
The healthcare algorithm was designed to identify patients who might benefit from additional care-management services. When researchers examined people with identical scores, they found that Black patients consistently had more chronic illness, more severe health conditions, and more signs of uncontrolled disease than White patients assigned the same level of risk. [Data 6]data6.orgJune 24, 2021…
This observation mattered because a risk score is supposed to summarise underlying need. If two patients receive the same score, decision-makers generally assume they have roughly similar health risks. The study showed that this assumption was false. The algorithm’s rankings systematically understated the health needs of many Black patients. [PubMed]pubmed.ncbi.nlm.nih.govDissecting racial bias in an algorithm used to manage the health of populations - PubMed…
Researchers measured illness burden in several ways, including the number of chronic conditions and broader indicators of health status. Across these measures, the pattern remained consistent: equal algorithmic scores did not correspond to equal levels of sickness. Black patients had to be substantially sicker before receiving the same risk rating as White patients. [Data 6]data6.orgJune 24, 2021…
This finding is especially important because the disparity was not obvious from conventional performance statistics. The model could appear successful when evaluated against its training target while still producing unequal rankings when used to guide care decisions. [Chicago Booth]chicagobooth.eduChicago BoothDissecting racial bias in an algorithm used to manage the health of populations - Tolan Center | Chicago Booth…
How under-ranking affected programme eligibility
The unequal scoring translated directly into access to support programmes.
Many health systems use risk thresholds to determine who receives additional monitoring, preventive interventions, case management, or specialised care coordination. Patients above a certain score are enrolled; those below it are not. Because Black patients were often assigned lower scores than their actual illness burden justified, many were less likely to qualify for these programmes. [Chicago Booth]chicagobooth.eduChicago BoothDissecting racial bias in an algorithm used to manage the health of populations - Tolan Center | Chicago Booth…
The researchers estimated that if the bias were removed, the share of Black patients identified for extra care would rise dramatically—from 17.7% to 46.5%. This was not a small statistical adjustment. It represented a major change in who would receive healthcare resources and support. [Chicago Booth]chicagobooth.eduChicago BoothDissecting racial bias in an algorithm used to manage the health of populations - Tolan Center | Chicago Booth…
The practical consequence was that two patients with similar medical needs could be treated differently by the allocation system. A White patient might cross the eligibility threshold while a Black patient with comparable or greater illness remained below it. The algorithm therefore influenced not only rankings on a screen but also real-world access to healthcare services. [Chicago Booth]chicagobooth.eduChicago BoothDissecting racial bias in an algorithm used to manage the health of populations - Tolan Center | Chicago Booth…
The pattern reflected a broader structural problem. Historical healthcare spending on Black patients was often lower than spending on White patients with similar health conditions. Because the algorithm learned from spending data, it inherited those patterns and reproduced them in its predictions. [Chicago Booth+2PubMed]chicagobooth.eduChicago BoothDissecting racial bias in an algorithm used to manage the health of populations - Tolan Center | Chicago Booth…
Why group-level checks matter before deployment
The equal-score finding became influential because it demonstrated a limitation of standard AI evaluation.
A model can perform well on average while still treating groups differently. If evaluators only examine aggregate accuracy, they may miss important disparities hidden inside the rankings. The healthcare case showed that an algorithm can correctly predict its chosen target yet still fail at the task decision-makers actually care about. [Chicago Booth]chicagobooth.eduChicago BoothDissecting racial bias in an algorithm used to manage the health of populations - Tolan Center | Chicago Booth…
Group-level analysis helped expose the problem. Instead of asking whether the model predicted future costs accurately, researchers asked a different question: when Black and White patients receive the same score, do they have similar levels of illness? The answer was clearly no. That comparison revealed a disparity invisible in many conventional performance reports. [Data 6]data6.orgJune 24, 2021…
The lesson extends beyond healthcare. Whenever an AI system ranks people for opportunities, services, or interventions, developers must examine how scores relate to real outcomes across different populations. Equal numerical outputs do not automatically imply equal treatment. A score may encode hidden assumptions inherited from the data used to train the model. [Chicago Booth]chicagobooth.eduChicago BoothDissecting racial bias in an algorithm used to manage the health of populations - Tolan Center | Chicago Booth…
For students of artificial intelligence, this case is a reminder that fairness questions often emerge not from the algorithm’s mathematics alone but from the relationship between the score and the reality it is meant to represent. The discovery that Black patients were sicker at the same risk score became one of the clearest demonstrations that AI systems should be evaluated not only for predictive accuracy but also for how their rankings affect different groups in practice. [Chicago Booth]chicagobooth.eduChicago BoothDissecting racial bias in an algorithm used to manage the health of populations - Tolan Center | Chicago Booth…
Amazon book picks
Further Reading
Books and field guides related to When equal risk scores hide unequal illness. Use these as the next step if you want deeper reading beyond the article.
Weapons of Math Destruction
Explains why seemingly neutral scores can produce unequal consequences.
Endnotes
-
Source: chicagobooth.edu
Link: https://www.chicagobooth.edu/research/tolan/research/2019/dissecting-racial-bias-in-an-algorithm-used-to-manage-the-health-of-populationsSource snippet
Chicago BoothDissecting racial bias in an algorithm used to manage the health of populations - Tolan Center | Chicago Booth...
-
Source: pubmed.ncbi.nlm.nih.gov
Link: https://pubmed.ncbi.nlm.nih.gov/31649194/Source snippet
Dissecting racial bias in an algorithm used to manage the health of populations - PubMed...
-
Source: data6.org
Link: https://data6.org/su22/assignments/DissectingRacialBias.pdfSource snippet
June 24, 2021...
Published: June 24, 2021
-
Source: ouci.dntb.gov.ua
Title: Dissecting racial bias in an algorithm used to manage the health of populations
Link: https://ouci.dntb.gov.ua/en/works/7P2G12Gl/
Additional References
-
Source: reddit.com
Title: Millions of black people affected by racial bias in health-care algorithms
Link: https://www.reddit.com/r/autotldr/comments/dn232jSource snippet
Millions of black people affected by racial bias in health-care algorithms...
-
Source: youtube.com
Title: Dissecting Racial Bias in an Algorithm that Guides Health Decisions for Millions
Link: https://www.youtube.com/watch?v=y6eo0FZIqjkSource snippet
Dissecting Algorithmic Bias | Ziad Obermeyer | AI FOR GOOD DISCOVERY...
-
Source: youtube.com
Title: Dissecting Algorithmic Bias | Ziad Obermeyer | AI FOR GOOD DISCOVERY
Link: https://www.youtube.com/watch?v=U5MlyFsMi-ESource snippet
Keynote Presentation: Dissecting Algorithmic Bias...
-
Source: youtube.com
Title: The Double-Edged Sword of AI, with Dr. Ziad Obermeyer
Link: https://www.youtube.com/watch?v=c5KcSNeGXOMSource snippet
The Double-Edged Sword of AI, with Dr. Ziad Obermeyer - YouTube...
-
Source: youtube.com
Title: Keynote Presentation: Dissecting Algorithmic Bias
Link: https://www.youtube.com/watch?v=JfKYO1W4uuASource snippet
The Double-Edged Sword of AI, with Dr. Ziad Obermeyer...
Topic Tree


