FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling

Read original: arXiv:2310.02492 - Published 4/15/2024 by Yan Luo, Muhammad Osama Khan, Yu Tian, Min Shi, Zehao Dou, Tobias Elze, Yi Fang, Mengyu Wang

FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling

Overview

This paper introduces the Harvard Eye Fairness (HEF) dataset, a large-scale 3D imaging dataset focused on addressing equity in eye disease screening and fair identity scaling.
The dataset aims to enable more equitable and inclusive computer vision models for eye health applications, which is important given the known biases and disparities in existing eye imaging datasets.
The paper analyzes the dataset's demographics, diversity, and potential uses for training fair and robust eye disease screening models.

Plain English Explanation

The researchers created a new dataset called Harvard Eye Fairness (HEF) that contains thousands of 3D eye images from a diverse range of people. This is important because many existing eye imaging datasets have biases and don't include enough data from underrepresented groups.

The goal of the HEF dataset is to help develop computer vision models for eye disease screening that are more fair and accurate for all people, regardless of their background. This is crucial, as biased models could lead to missed diagnoses or improper treatments for certain populations.

By analyzing the HEF dataset, the researchers found it is much more diverse and representative than previous eye imaging datasets. This gives researchers and companies building eye health technology a better resource to train their models to be equitable and inclusive.

The paper also discusses how the HEF dataset can be used to evaluate the fairness of computer vision models, and to scale identity information in a way that respects privacy and avoids reinforcing harmful stereotypes. This is an important consideration as these technologies become more widely deployed.

Technical Explanation

The paper introduces the Harvard Eye Fairness (HEF) dataset, a large-scale 3D eye imaging dataset designed to enable more equitable and inclusive computer vision models for eye health applications. The dataset contains high-quality 3D scans of over 100,000 eyes from diverse participants, with detailed metadata on demographics, eye health conditions, and identity attributes.

The authors analyze the dataset's composition and diversity, showing it is significantly more representative than prior eye imaging datasets which have been found to have significant biases. They demonstrate how the HEF dataset can be used to evaluate the fairness of eye disease screening models, as well as to scale identity information in a privacy-preserving way that avoids reinforcing harmful stereotypes.

Experiments on the HEF dataset reveal that existing state-of-the-art eye disease screening models exhibit concerning disparities in performance across demographic groups. The authors propose novel techniques for training more equitable and robust models, leveraging the rich metadata in the HEF dataset.

The paper also discusses the ethical considerations around the collection and use of sensitive identity data, and outlines steps taken to protect participant privacy while enabling fair and inclusive model development.

Critical Analysis

The HEF dataset represents an important contribution to addressing bias and inequity in computer vision for eye health applications. By providing a large-scale, diverse dataset, the researchers enable the development of more inclusive and fair models.

However, the dataset is still limited to the Boston metropolitan area, and may not fully capture the global diversity of eye morphologies and disease prevalence. Further research is needed to evaluate the generalizability of models trained on HEF to other populations and settings.

Additionally, while the dataset includes rich metadata on demographics and identity attributes, the authors acknowledge potential biases in how these are self-reported or measured. Ongoing work is needed to further validate the accuracy and representativeness of this data.

The paper also does not deeply explore the societal implications of deploying eye disease screening models at scale, and how to ensure these technologies benefit all communities equitably. Careful consideration of the downstream uses and potential harms is crucial as this work progresses.

Overall, the HEF dataset and the techniques developed in this paper represent an important step towards more fair and inclusive computer vision for eye health. Continued research, collaboration, and responsible deployment will be essential to realizing the full potential of this work.

Conclusion

The Harvard Eye Fairness (HEF) dataset introduced in this paper is a significant contribution to addressing bias and inequity in computer vision for eye health applications. By providing a large-scale, diverse dataset of 3D eye images, the researchers enable the development of more inclusive and fair disease screening models.

The detailed analysis of the dataset's demographics and diversity shows it is much more representative than previous eye imaging collections, which have been plagued by biases. This gives researchers and companies building eye health technologies a valuable resource to train equitable and robust models.

The paper also outlines methods for evaluating model fairness and scaling identity information in a privacy-preserving way, key considerations as these technologies become more widely deployed. While limitations and ethical challenges remain, the HEF dataset represents an important step towards ensuring eye health technologies benefit all communities equitably.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling

Yan Luo, Muhammad Osama Khan, Yu Tian, Min Shi, Zehao Dou, Tobias Elze, Yi Fang, Mengyu Wang

Equity in AI for healthcare is crucial due to its direct impact on human well-being. Despite advancements in 2D medical imaging fairness, the fairness of 3D models remains underexplored, hindered by the small sizes of 3D fairness datasets. Since 3D imaging surpasses 2D imaging in SOTA clinical care, it is critical to understand the fairness of these 3D models. To address this research gap, we conduct the first comprehensive study on the fairness of 3D medical imaging models across multiple protected attributes. Our investigation spans both 2D and 3D models and evaluates fairness across five architectures on three common eye diseases, revealing significant biases across race, gender, and ethnicity. To alleviate these biases, we propose a novel fair identity scaling (FIS) method that improves both overall performance and fairness, outperforming various SOTA fairness methods. Moreover, we release Harvard-FairVision, the first large-scale medical fairness dataset with 30,000 subjects featuring both 2D and 3D imaging data and six demographic identity attributes. Harvard-FairVision provides labels for three major eye disorders affecting about 380 million people worldwide, serving as a valuable resource for both 2D and 3D fairness learning. Our code and dataset are publicly accessible at url{https://ophai.hms.harvard.edu/datasets/harvard-fairvision30k}.

4/15/2024

Toward Fairer Face Recognition Datasets

Alexandre Fournier-Mongieux, Michael Soumm, Adrian Popescu, Bertrand Luvison, Herv'e Le Borgne

Face recognition and verification are two computer vision tasks whose performance has progressed with the introduction of deep representations. However, ethical, legal, and technical challenges due to the sensitive character of face data and biases in real training datasets hinder their development. Generative AI addresses privacy by creating fictitious identities, but fairness problems persist. We promote fairness by introducing a demographic attributes balancing mechanism in generated training datasets. We experiment with an existing real dataset, three generated training datasets, and the balanced versions of a diffusion-based dataset. We propose a comprehensive evaluation that considers accuracy and fairness equally and includes a rigorous regression-based statistical analysis of attributes. The analysis shows that balancing reduces demographic unfairness. Also, a performance gap persists despite generation becoming more accurate with time. The proposed balancing method and comprehensive verification evaluation promote fairer and transparent face recognition and verification.

6/26/2024

🖼️

FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

Yu Tian, Min Shi, Yan Luo, Ava Kouhana, Tobias Elze, Mengyu Wang

Fairness in artificial intelligence models has gained significantly more attention in recent years, especially in the area of medicine, as fairness in medical models is critical to people's well-being and lives. High-quality medical fairness datasets are needed to promote fairness learning research. Existing medical fairness datasets are all for classification tasks, and no fairness datasets are available for medical segmentation, while medical segmentation is an equally important clinical task as classifications, which can provide detailed spatial information on organ abnormalities ready to be assessed by clinicians. In this paper, we propose the first fairness dataset for medical segmentation named Harvard-FairSeg with 10,000 subject samples. In addition, we propose a fair error-bound scaling approach to reweight the loss function with the upper error-bound in each identity group, using the segment anything model (SAM). We anticipate that the segmentation performance equity can be improved by explicitly tackling the hard cases with high training errors in each identity group. To facilitate fair comparisons, we utilize a novel equity-scaled segmentation performance metric to compare segmentation metrics in the context of fairness, such as the equity-scaled Dice coefficient. Through comprehensive experiments, we demonstrate that our fair error-bound scaling approach either has superior or comparable fairness performance to the state-of-the-art fairness learning models. The dataset and code are publicly accessible via https://ophai.hms.harvard.edu/datasets/harvard-fairseg10k.

5/2/2024

FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification

Yu Tian, Congcong Wen, Min Shi, Muhammad Muneeb Afzal, Hao Huang, Muhammad Osama Khan, Yan Luo, Yi Fang, Mengyu Wang

Addressing fairness in artificial intelligence (AI), particularly in medical AI, is crucial for ensuring equitable healthcare outcomes. Recent efforts to enhance fairness have introduced new methodologies and datasets in medical AI. However, the fairness issue under the setting of domain transfer is almost unexplored, while it is common that clinics rely on different imaging technologies (e.g., different retinal imaging modalities) for patient diagnosis. This paper presents FairDomain, a pioneering systemic study into algorithmic fairness under domain shifts, employing state-of-the-art domain adaptation (DA) and generalization (DG) algorithms for both medical segmentation and classification tasks to understand how biases are transferred between different domains. We also introduce a novel plug-and-play fair identity attention (FIA) module that adapts to various DA and DG algorithms to improve fairness by using self-attention to adjust feature importance based on demographic attributes. Additionally, we curate the first fairness-focused dataset with two paired imaging modalities for the same patient cohort on medical segmentation and classification tasks, to rigorously assess fairness in domain-shift scenarios. Excluding the confounding impact of demographic distribution variation between source and target domains will allow clearer quantification of the performance of domain transfer models. Our extensive evaluations reveal that the proposed FIA significantly enhances both model performance accounted for fairness across all domain shift settings (i.e., DA and DG) with respect to different demographics, which outperforms existing methods on both segmentation and classification. The code and data can be accessed at https://ophai.hms.harvard.edu/datasets/harvard-fairdomain20k.

7/22/2024