Unmasking Bias in AI: A Systematic Review of Bias Detection and Mitigation Strategies in Electronic Health Record-based Models

Read original: arXiv:2310.19917 - Published 7/2/2024 by Feng Chen, Liqin Wang, Julie Hong, Jiaqi Jiang, Li Zhou

🔎

Overview

This study reviews methods to detect and mitigate diverse forms of bias in AI models developed using electronic health records (EHRs).
The researchers conducted a systematic review following the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines.
The review identified key biases, outlined strategies for detecting and mitigating bias throughout the AI model development process, and analyzed metrics for bias assessment.

Plain English Explanation

Healthcare systems are increasingly using artificial intelligence (AI) models developed with electronic health records (EHRs) to improve decision-making and patient outcomes. However, these AI models can sometimes exhibit biases that can worsen healthcare disparities. This study examines different types of biases that can arise in AI models built using EHR data and explores methods to detect and address these biases.

The researchers systematically reviewed scientific literature to identify the key biases, strategies for detecting and mitigating them, and metrics for assessing bias. They found that AI models in healthcare can be affected by various types of biases, including algorithmic bias, confounding bias, implicit bias, measurement bias, selection bias, and temporal bias.

To address these biases, the researchers found that data collection and preprocessing techniques like resampling, reweighting, and transformation can be effective. They also identified the need for standardized, generalizable, and interpretable methodologies to foster the creation of ethical AI systems that promote fairness and equity in healthcare.

Technical Explanation

The researchers conducted a systematic review following the PRISMA guidelines to identify and analyze methods for detecting and mitigating bias in AI models developed using EHR data. They searched PubMed, Web of Science, and IEEE databases for relevant articles published between January 1, 2010, and December 17, 2023, and identified 20 articles that met their criteria.

The review revealed six major types of bias that can affect AI models in healthcare settings: algorithmic bias, confounding bias, implicit bias, measurement bias, selection bias, and temporal bias. The AI models were primarily developed for predictive tasks, such as disease diagnosis or risk prediction.

The researchers found that four of the studies focused on the detection of implicit and algorithmic biases, employing fairness metrics like statistical parity, equal opportunity, and predictive equity. Additionally, 60 of the studies proposed various strategies for mitigating biases, especially targeting implicit and selection biases. These strategies were evaluated through both performance (e.g., accuracy, AUROC) and fairness metrics, and predominantly involved data collection and preprocessing techniques like resampling, reweighting, and transformation.

Critical Analysis

The review highlights the varied and evolving nature of strategies to address bias in EHR-based AI models. However, the researchers note that there is an urgent need for the establishment of standardized, generalizable, and interpretable methodologies to foster the creation of ethical AI systems that promote fairness and equity in healthcare.

One potential limitation of the study is that it may not have captured all relevant research, as the review was limited to articles published between January 1, 2010, and December 17, 2023. Additionally, the review focused on a relatively small set of 20 articles, which may not fully represent the breadth of work in this rapidly evolving field.

Furthermore, the review did not deeply explore the role of interdisciplinary expertise in advancing equitable and explainable AI, which could be a crucial factor in developing effective bias mitigation strategies. Collaboration between AI researchers, healthcare professionals, ethicists, and other relevant stakeholders may be necessary to address the complex challenges of bias in healthcare AI.

Conclusion

This systematic review highlights the need for robust and systematic approaches to detecting and mitigating bias in AI models developed using EHR data. The researchers identified key types of bias and various strategies for addressing them, but emphasized the urgent need for standardized, generalizable, and interpretable methodologies to ensure the development of ethical AI systems that promote fairness and equity in healthcare. Continued research and cross-disciplinary collaboration will be crucial to realizing the transformative potential of AI in healthcare while safeguarding against the risks of bias and healthcare disparities.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔎

Unmasking Bias in AI: A Systematic Review of Bias Detection and Mitigation Strategies in Electronic Health Record-based Models

Feng Chen, Liqin Wang, Julie Hong, Jiaqi Jiang, Li Zhou

Objectives: Leveraging artificial intelligence (AI) in conjunction with electronic health records (EHRs) holds transformative potential to improve healthcare. Yet, addressing bias in AI, which risks worsening healthcare disparities, cannot be overlooked. This study reviews methods to detect and mitigate diverse forms of bias in AI models developed using EHR data. Methods: We conducted a systematic review following the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines, analyzing articles from PubMed, Web of Science, and IEEE published between January 1, 2010, and Dec 17, 2023. The review identified key biases, outlined strategies for detecting and mitigating bias throughout the AI model development process, and analyzed metrics for bias assessment. Results: Of the 450 articles retrieved, 20 met our criteria, revealing six major bias types: algorithmic, confounding, implicit, measurement, selection, and temporal. The AI models were primarily developed for predictive tasks in healthcare settings. Four studies concentrated on the detection of implicit and algorithmic biases employing fairness metrics like statistical parity, equal opportunity, and predictive equity. Sixty proposed various strategies for mitigating biases, especially targeting implicit and selection biases. These strategies, evaluated through both performance (e.g., accuracy, AUROC) and fairness metrics, predominantly involved data collection and preprocessing techniques like resampling, reweighting, and transformation. Discussion: This review highlights the varied and evolving nature of strategies to address bias in EHR-based AI models, emphasizing the urgent needs for the establishment of standardized, generalizable, and interpretable methodologies to foster the creation of ethical AI systems that promote fairness and equity in healthcare.

7/2/2024

✅

AI-Driven Healthcare: A Survey on Ensuring Fairness and Mitigating Bias

Sribala Vidyadhari Chinta, Zichong Wang, Xingyu Zhang, Thang Doan Viet, Ayesha Kashif, Monique Antoinette Smith, Wenbin Zhang

Artificial intelligence (AI) is rapidly advancing in healthcare, enhancing the efficiency and effectiveness of services across various specialties, including cardiology, ophthalmology, dermatology, emergency medicine, etc. AI applications have significantly improved diagnostic accuracy, treatment personalization, and patient outcome predictions by leveraging technologies such as machine learning, neural networks, and natural language processing. However, these advancements also introduce substantial ethical and fairness challenges, particularly related to biases in data and algorithms. These biases can lead to disparities in healthcare delivery, affecting diagnostic accuracy and treatment outcomes across different demographic groups. This survey paper examines the integration of AI in healthcare, highlighting critical challenges related to bias and exploring strategies for mitigation. We emphasize the necessity of diverse datasets, fairness-aware algorithms, and regulatory frameworks to ensure equitable healthcare delivery. The paper concludes with recommendations for future research, advocating for interdisciplinary approaches, transparency in AI decision-making, and the development of innovative and inclusive AI applications.

7/30/2024

🌐

Towards objective and systematic evaluation of bias in artificial intelligence for medical imaging

Emma A. M. Stanley, Raissa Souza, Anthony Winder, Vedant Gulve, Kimberly Amador, Matthias Wilms, Nils D. Forkert

Artificial intelligence (AI) models trained using medical images for clinical tasks often exhibit bias in the form of disparities in performance between subgroups. Since not all sources of biases in real-world medical imaging data are easily identifiable, it is challenging to comprehensively assess how those biases are encoded in models, and how capable bias mitigation methods are at ameliorating performance disparities. In this article, we introduce a novel analysis framework for systematically and objectively investigating the impact of biases in medical images on AI models. We developed and tested this framework for conducting controlled in silico trials to assess bias in medical imaging AI using a tool for generating synthetic magnetic resonance images with known disease effects and sources of bias. The feasibility is showcased by using three counterfactual bias scenarios to measure the impact of simulated bias effects on a convolutional neural network (CNN) classifier and the efficacy of three bias mitigation strategies. The analysis revealed that the simulated biases resulted in expected subgroup performance disparities when the CNN was trained on the synthetic datasets. Moreover, reweighing was identified as the most successful bias mitigation strategy for this setup, and we demonstrated how explainable AI methods can aid in investigating the manifestation of bias in the model using this framework. Developing fair AI models is a considerable challenge given that many and often unknown sources of biases can be present in medical imaging datasets. In this work, we present a novel methodology to objectively study the impact of biases and mitigation strategies on deep learning pipelines, which can support the development of clinical AI that is robust and responsible.

7/2/2024

Reducing Biases towards Minoritized Populations in Medical Curricular Content via Artificial Intelligence for Fairer Health Outcomes

Chiman Salavati, Shannon Song, Willmar Sosa Diaz, Scott A. Hale, Roberto E. Montenegro, Fabricio Murai, Shiri Dori-Hacohen

Biased information (recently termed bisinformation) continues to be taught in medical curricula, often long after having been debunked. In this paper, we introduce BRICC, a firstin-class initiative that seeks to mitigate medical bisinformation using machine learning to systematically identify and flag text with potential biases, for subsequent review in an expert-in-the-loop fashion, thus greatly accelerating an otherwise labor-intensive process. A gold-standard BRICC dataset was developed throughout several years, and contains over 12K pages of instructional materials. Medical experts meticulously annotated these documents for bias according to comprehensive coding guidelines, emphasizing gender, sex, age, geography, ethnicity, and race. Using this labeled dataset, we trained, validated, and tested medical bias classifiers. We test three classifier approaches: a binary type-specific classifier, a general bias classifier; an ensemble combining bias type-specific classifiers independently-trained; and a multitask learning (MTL) model tasked with predicting both general and type-specific biases. While MTL led to some improvement on race bias detection in terms of F1-score, it did not outperform binary classifiers trained specifically on each task. On general bias detection, the binary classifier achieves up to 0.923 of AUC, a 27.8% improvement over the baseline. This work lays the foundations for debiasing medical curricula by exploring a novel dataset and evaluating different training model strategies. Hence, it offers new pathways for more nuanced and effective mitigation of bisinformation.

7/18/2024