Shortcut Learning in Medical Image Segmentation
Authors:
Manxi Lin,
Nina Weng,
Kamil Mikolaj,
Zahra Bashir,
Morten Bo Søndergaard Svendsen,
Martin Tolsgaard,
Anders Nymark Christensen,
Aasa Feragen
Abstract:
Shortcut learning is a phenomenon where machine learning models prioritize learning simple, potentially misleading cues from data that do not generalize well beyond the training set. While existing research primarily investigates this in the realm of image classification, this study extends the exploration of shortcut learning into medical image segmentation. We demonstrate that clinical annotatio…
▽ More
Shortcut learning is a phenomenon where machine learning models prioritize learning simple, potentially misleading cues from data that do not generalize well beyond the training set. While existing research primarily investigates this in the realm of image classification, this study extends the exploration of shortcut learning into medical image segmentation. We demonstrate that clinical annotations such as calipers, and the combination of zero-padded convolutions and center-cropped training sets in the dataset can inadvertently serve as shortcuts, impacting segmentation accuracy. We identify and evaluate the shortcut learning on two different but common medical image segmentation tasks. In addition, we suggest strategies to mitigate the influence of shortcut learning and improve the generalizability of the segmentation models. By uncovering the presence and implications of shortcuts in medical image segmentation, we provide insights and methodologies for evaluating and overcoming this pervasive challenge and call for attention in the community for shortcuts in segmentation. Our code is public at https://github.com/nina-weng/shortcut_skinseg .
△ Less
Submitted 27 June, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
Generalisability of fetal ultrasound deep learning models to low-resource imaging settings in five African countries
Authors:
Carla Sendra-Balcells,
Víctor M. Campello,
Jordina Torrents-Barrena,
Yahya Ali Ahmed,
Mustafa Elattar,
Benard Ohene Botwe,
Pempho Nyangulu,
William Stones,
Mohammed Ammar,
Lamya Nawal Benamer,
Harriet Nalubega Kisembo,
Senai Goitom Sereke,
Sikolia Z. Wanyonyi,
Marleen Temmerman,
Eduard Gratacós,
Elisenda Bonet,
Elisenda Eixarch,
Kamil Mikolaj,
Martin Grønnebæk Tolsgaard,
Karim Lekadir
Abstract:
Most artificial intelligence (AI) research have concentrated in high-income countries, where imaging data, IT infrastructures and clinical expertise are plentiful. However, slower progress has been made in limited-resource environments where medical imaging is needed. For example, in Sub-Saharan Africa the rate of perinatal mortality is very high due to limited access to antenatal screening. In th…
▽ More
Most artificial intelligence (AI) research have concentrated in high-income countries, where imaging data, IT infrastructures and clinical expertise are plentiful. However, slower progress has been made in limited-resource environments where medical imaging is needed. For example, in Sub-Saharan Africa the rate of perinatal mortality is very high due to limited access to antenatal screening. In these countries, AI models could be implemented to help clinicians acquire fetal ultrasound planes for diagnosis of fetal abnormalities. So far, deep learning models have been proposed to identify standard fetal planes, but there is no evidence of their ability to generalise in centres with limited access to high-end ultrasound equipment and data. This work investigates different strategies to reduce the domain-shift effect for a fetal plane classification model trained on a high-resource clinical centre and transferred to a new low-resource centre. To that end, a classifier trained with 1,792 patients from Spain is first evaluated on a new centre in Denmark in optimal conditions with 1,008 patients and is later optimised to reach the same performance in five African centres (Egypt, Algeria, Uganda, Ghana and Malawi) with 25 patients each. The results show that a transfer learning approach can be a solution to integrate small-size African samples with existing large-scale databases in developed countries. In particular, the model can be re-aligned and optimised to boost the performance on African populations by increasing the recall to $0.92\pm0.04$ and at the same time maintaining a high precision across centres. This framework shows promise for building new AI models generalisable across clinical centres with limited data acquired in challenging and heterogeneous conditions and calls for further research to develop new solutions for usability of AI in countries with less resources.
△ Less
Submitted 14 February, 2023; v1 submitted 20 September, 2022;
originally announced September 2022.