AI can inexplicably detect race when we don’t want it to

What’s happening? AI models used to examine x-rays can identify a patient’s race, but how the algorithms do so cannot be determined, according to a multi-institution study. Researchers tested five radiology research x-ray image types, including mammograms. Despite racial identity not being a biological category, the algorithms detected racial groups with an accuracy of between 80% and 99%. Anatomical or visual features, age, sex, or specific diagnoses were ruled out as classifying factors. Further research into the issue is required, as is educating patients, the authors said. Previous studies have highlighted medical algorithm bias in care delivery, among other areas. (Wired)

But… how? The system discussed above is a type of machine learning which – in a similar way to humans – ingests data and forms an understanding of the connections between different pieces of information.

Somehow – in a way we can’t quite see or understand – the x-rays that these models are being fed have information that allows the AI to make connections and accurately guess the race of patients.

Is that really such a bad thing? On its own, the ability to identify race through machine learning wouldn’t be particularly noteworthy. However, this model wasn’t built to determine race, it is used to identify potentially dangerous health issues (that have nothing to do with race!). What makes this story so concerning is – now that the model has this racial information – how could that knowledge begin to distort its diagnoses?

If a model begins to notice race (where it isn’t necessary), it might begin to make recommendations based on past cases it’s learned from, where racial bias has very much played a part in how people were diagnosed. Allowing it to do that would not only perpetuate old biases and disparities, but it might actually make them more pronounced.

Machine learning algorithms have already been shown to be fallible in this area. In 2019 an algorithm widely used to prioritise care for seriously ill patients was shown to disadvantaged Black patients, while in 2020 one algorithm consistently assigned lower risk scores to Black patients with kidney disease, downplaying the seriousness of their disease. Another, trained to flag pneumonia and other chest conditions, performed differently for people of different sexes, ages, races, and types of medical insurance.

Lateral thought – Health care and medicine aren’t the only areas where we need to be vigilant about machine learning making connections to irrelevant demographic data. AI models are increasingly replacing human judgement in industries including insurance pricing, credit checks, prison sentencing, risk assessment and many more. If such models are picking up on personal data without being asked to, it could compromise analyses that are meant to be objective.

For example, an algorithm used to credit check individuals for a loan might determine that person is Black, and decide (based on a long history of banks denying Black people financial services) to deny them that loan. If a model was set to assess climate risk for a neighbourhood, would it first try to determine whether its residents were high income or low income to then inform its response? Would an algorithm directing medical services prioritise wealthy white people because it worked out they were likely to live longer?

Share This Post

Cookie	Duration	Description
__hssrc	session	This cookie is set by Hubspot. According to their documentation, whenever HubSpot changes the session cookie, this cookie is also set to determine if the visitor has restarted their browser. If this cookie does not exist when HubSpot manages cookies, it is considered a new session.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-non-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non-necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__hssc	30 minutes	This cookie is set by HubSpot. The purpose of the cookie is to keep track of sessions. This is used to determine if HubSpot should increment the session number and timestamps in the __hstc cookie. It contains the domain, viewCount (increments each pageView in a session), and session start timestamp.
_gat	1 minute	This cookies is installed by Google Universal Analytics to throttle the request rate to limit the colllection of data on high traffic sites.
bcookie	2 years	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	This cookie is set by LinkedIn and used for routing.

Cookie	Duration	Description
__hstc	1 year 24 days	This cookie is set by Hubspot and is used for tracking visitors. It contains the domain, utk, initial timestamp (first visit), last timestamp (last visit), current timestamp (this visit), and session number (increments for each subsequent session).
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gat_gtag_UA_39710400_12	1 minute	This cookie is set by Google and is used to distinguish users.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
_lr_hb_-trmgqt%2Fcuration-website	1 day
_lr_tabs_-trmgqt%2Fcuration-website	1 day
_lr_uf_-trmgqt	session
ajs_anonymous_id	1 year	This cookie is set by Segment.io to check the number of new and returning visitors to the website.
ajs_user_id	never	The cookie is set by Segment.io and is used to analyze how you use the website
AMP_TOKEN		This cookie is set by Google Analytics - This cookie contains a token that can be used to retrieve a Client ID from AMP Client ID service. Other possible values indicate opt-out, inflight request or an error retrieving a Client ID from AMP Client ID service.
AnalyticsSyncHistory	1 month
hubspotutk	1 year 24 days	This cookie is used by HubSpot to keep track of the visitors to the website. This cookie is passed to Hubspot on form submission and used when deduplicating contacts.a
li_gc	2 years

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to deliver advertisement when they are on Facebook or a digital platform powered by Facebook advertising after visiting this website.
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
UserMatchHistory	1 month

AI can inexplicably detect race when we don’t want it to

You might also like

IBAT launches new DLE lithium extraction technology in commercialisation move

Microsoft and Occidental Petroleum sign record carbon credit deal to offset data centre and AI-driven emissions

Shell faces $2bn impairment charge over biofuel backtrack