> We know that various features visible in medical images correlate with race, e...

ChrisLovejoy · on Sept 14, 2021

I suspect there must be some confounder here - like the positioning used for the CXRs correlating with race, based on the methodology used in a particular region / hospital.

Seems the most likely explanation for it still working even when pixellated as 8x8?

OneEyedRobot · on Sept 14, 2021

That was exactly my thinking. I wouldn't publish anything until the pixellated versions were better understood.

SpicyLemonZest · on Sept 14, 2021

Well, isn’t the point of publishing to get help figuring it out from other researchers in the field? I agree it’s very likely that there’s some kind of explainable trick the AI is using, but there’s no guarantee it’s an easy trick that the authors could have figured out.

OneEyedRobot · on Sept 14, 2021

I'd warm up to that concept if the article was: "We don't know what in the hell is going on here. Here's our source code and data set of x-rays and race. What do you think?"

It could be that in the realm of machine learning, most of what is going on is people turning random knobs on a big machine and getting mysterious results. It's the birth of science without understanding.

SpicyLemonZest · on Sept 14, 2021

That's precisely what the researchers are saying. In the underlying paper, they conclude that "this capability is extremely difficult to isolate or mitigate", call for "further investigation and research into the human-hidden but model-decipherable information", and suggest medical imaging people should "consider the use of deep learning models with extreme caution" until future research produces a better understanding of what's happening.

OneEyedRobot · on Sept 14, 2021

They always call for 'further investigation'.

Looking at this: https://arxiv.org/pdf/2107.10356.pdf

My general impression (no more than that) is a whole bunch of people crowding into a paper. The paper is mostly applying trivial image processing functions and seeing how some software they don't understand is responding. The main aim is pearl-clutching about 'bias' rather than any kind of understanding. God knows what they're going to do when any medical exam includes some kind of deep dive into the patient's genetics.

No surprises. It's the nature of the era.

abrichr · on Sept 14, 2021

That was my initial reaction as well but they validate on separate datasets from training which makes this unlikely.

The performance despite degradation may be the same phenomenon that results in adversarial examples that are indistinguishable to human eyes, ie we know that neural nets are highly sensitive to visually imperceptible differences.