top | item 40862944

(no title)

ogarten | 1 year ago

How does this surprise anyone?

Medical data for AI training is almost always sources in some more or less shady country because they lack any privacy regulations. It's then annotated by a hoard of cheap workers who may or may not have advanced medical training.

Even "normal medicine" is extremely biased towards male people fitting inside the norm which is why a lot of things are not detected early enough in women or in people who do not match that norm.

Next thing: Doctors often think that their annotations are the absolute gold standard but they don't necessarily know everything that is in an X-Ray or an MRI.

A few years ago we tried to build synthetic data for this exact purpose by simulating medical images for 3D body models with different diseases and nobody we talked to cared about it, because "we have good data".

discuss

order

aprilthird2021|1 year ago

Yep, you nailed it. You really don't have to think hard about why AI which only learns from what we feed it and can access has gaps and biases more pronounced than the real world. AI lives in the internet world, it's trained on horrible cesspools of anonymous text like 4chan and reddit. No wonder it will be biased. If you only try to feed it sanitary data you wouldn't have enough to get the results we get now.

DrScientist|1 year ago

> You really don't have to think hard about why AI which only learns from what we feed it

Sadly I'd say that people are no different.

> it's trained on horrible cesspools of ....

So it's really not the future of AI we should be worrying about...

resource_waste|1 year ago

I have been quite anti-HIPPA since realizing how 'privacy' was the excuse to stunt science.

My conspiracy: With massive medical data, ML/AI would have been 'discovered'/built sooner. Limiting the data makes it so only a few people can be specialists under the supervision of medical cartels.

KingOfCoders|1 year ago

Great, where can I find your medical data on the web? Care to give an URL? Would be perfect to include your salary.

davedx|1 year ago

I know GPT4o can diagnose medical images. Is their model likely to be using the same kind of datasets as these models for medical systems?