Federated Learning with Incomplete Sensing Modalities

2405.11828

Published 5/21/2024 by Adiba Orzikulova, Jaehyun Kwak, Jaemin Shin, Sung-Ju Lee

Federated Learning with Incomplete Sensing Modalities

Abstract

Many mobile sensing applications utilize data from various modalities, including motion and physiological sensors in mobile and wearable devices. Federated Learning (FL) is particularly suitable for these applications thanks to its privacy-preserving feature. However, challenges such as limited battery life, poor network conditions, and sensor malfunctions can restrict the use of all available modalities for local model training. Additionally, existing multimodal FL systems also struggle with scalability and efficiency as the number of modality sources increases. To address these issues, we introduce FLISM, a framework designed to enable multimodal FL with incomplete modalities. FLISM leverages simulation technique to learn robust representations that can handle missing modalities and transfers model knowledge across clients with varying set of modalities. The evaluation results using three real-world datasets and simulations demonstrate FLISM's effective balance between model performance and system efficiency. It shows an average improvement of .067 in F1-score, while also reducing communication (2.69x faster) and computational (2.28x more efficient) overheads compared to existing methods addressing incomplete modalities. Moreover, in simulated scenarios involving tasks with a larger number of modalities, FLISM achieves a significant speedup of 3.23x~85.10x in communication and 3.73x~32.29x in computational efficiency.

Create account to get full access

Overview

Explores a federated learning approach to handle incomplete sensing modalities in multimodal systems
Addresses scalability and efficiency challenges in federated learning with multiple sensing modalities
Proposes techniques to mitigate the impact of missing data and improve overall system performance

Plain English Explanation

Federated learning is a way of training machine learning models without centralizing all the data. Instead, the data stays on individual devices, and the models are trained collaboratively. This is useful for privacy-sensitive applications like healthcare or finance, where centralized data storage may not be feasible or desirable.

In this paper, the researchers look at a specific challenge in federated learning: what happens when some of the "sensing modalities" (i.e., types of sensor data) are missing on certain devices? For example, imagine a system that uses both camera and microphone data to detect events. If some devices only have a camera and no microphone, that could cause problems for the federated learning process.

The researchers propose several techniques to address this issue of incomplete sensing modalities. They show how these techniques can improve the overall scalability and efficiency of the federated learning system, even when dealing with missing data. This is an important step towards making federated learning more practical for real-world applications with diverse and potentially incomplete sensor data.

Technical Explanation

The paper introduces a federated learning framework for handling incomplete sensing modalities in multimodal federated learning systems. The researchers propose techniques to mitigate the impact of missing data and improve the overall scalability and efficiency of the system.

Key aspects of the technical approach include:

Modality-aware parameter sharing: The model parameters are shared across devices, but with specific adaptations for each sensing modality to account for missing data.
Partial model aggregation: During the federated learning process, the server only aggregates the parameters corresponding to the available sensing modalities on each device.
Personalized model adaptation: Each device can further fine-tune the federated model to its local data and sensing capabilities, improving performance.

The paper evaluates the proposed techniques on several multimodal datasets and shows significant improvements in model accuracy and convergence speed compared to baseline federated learning approaches.

Critical Analysis

The paper presents a well-designed solution to an important challenge in federated learning for multimodal systems. The techniques proposed, such as modality-aware parameter sharing and partial model aggregation, are clever and effectively address the problem of incomplete sensing modalities.

However, the paper does not delve deeply into the potential privacy-preserving aspects of the federated learning approach. While federated learning is generally seen as a more privacy-friendly alternative to centralized machine learning, the researchers could have discussed how their specific techniques might impact privacy guarantees or introduce new trade-offs.

Additionally, the paper does not explore the energy and latency implications of the proposed methods, which would be an important consideration for real-world deployment, especially in resource-constrained edge devices.

Conclusion

This paper presents an innovative federated learning approach to handle incomplete sensing modalities in multimodal systems. By addressing the challenges of scalability and efficiency, the researchers have made an important contribution towards making federated learning more practical for real-world applications with diverse and potentially incomplete sensor data.

The proposed techniques, such as modality-aware parameter sharing and personalized model adaptation, show promising results and could pave the way for more robust and versatile federated learning systems. As the field of federated learning continues to evolve, addressing issues like privacy and resource constraints will be crucial for widespread adoption.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

⚙️

Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality

Liwei Che, Jiaqi Wang, Xinyue Liu, Fenglong Ma

Federated learning (FL) has obtained tremendous progress in providing collaborative training solutions for distributed data silos with privacy guarantees. However, few existing works explore a more realistic scenario where the clients hold multiple data modalities. In this paper, we aim to solve a novel challenge in multi-modal federated learning (MFL) -- modality missing -- the clients may lose part of the modalities in their local data sets. To tackle the problems, we propose a novel multi-modal federated learning method, Federated Multi-modal contrastiVe training with Pre-trained completion (FedMVP), which integrates the large-scale pre-trained models to enhance the federated training. In the proposed FedMVP framework, each client deploys a large-scale pre-trained model with frozen parameters for modality completion and representation knowledge transfer, enabling efficient and robust local training. On the server side, we utilize generated data to uniformly measure the representation similarity among the uploaded client models and construct a graph perspective to aggregate them according to their importance in the system. We demonstrate that the model achieves superior performance over two real-world image-text classification datasets and is robust to the performance degradation caused by missing modality.

6/18/2024

cs.LG cs.DC

Towards Multi-modal Transformers in Federated Learning

Guangyu Sun, Matias Mendieta, Aritra Dutta, Xin Li, Chen Chen

Multi-modal transformers mark significant progress in different domains, but siloed high-quality data hinders their further improvement. To remedy this, federated learning (FL) has emerged as a promising privacy-preserving paradigm for training models without direct access to the raw data held by different clients. Despite its potential, a considerable research direction regarding the unpaired uni-modal clients and the transformer architecture in FL remains unexplored. To fill this gap, this paper explores a transfer multi-modal federated learning (MFL) scenario within the vision-language domain, where clients possess data of various modalities distributed across different datasets. We systematically evaluate the performance of existing methods when a transformer architecture is utilized and introduce a novel framework called Federated modality complementary and collaboration (FedCola) by addressing the in-modality and cross-modality gaps among clients. Through extensive experiments across various FL settings, FedCola demonstrates superior performance over previous approaches, offering new perspectives on future federated training of multi-modal transformers.

4/22/2024

cs.CV cs.LG

🤷

Semi-Federated Learning for Internet of Intelligence

Wanli Ni, Zhaohui Yang

One key vision of intelligent Internet of Things (IoT) is to provide connected intelligence for a large number of application scenarios, such as self-driving cars, industrial manufacturing, and smart city. However, existing centralized or federated learning paradigms have difficulties in coordinating heterogeneous resources in distributed IoT environments. In this article, we introduce a semi-federated learning (SemiFL) framework to tackle the challenges of data and device heterogeneity in massive IoT networks. In SemiFL, only users with sufficient computing resources are selected for local model training, while the remaining users only transmit raw data to the base station for remote computing. By doing so, SemiFL incorporates conventional centralized and federated learning paradigms into a harmonized framework that allows all devices to participate in the global model training regardless of their computational capabilities and data distributions. Furthermore, we propose a next-generation multiple access scheme by seamlessly integrating communication and computation over the air. This achieves the concurrent transmission of raw data and model parameters in a spectrum-efficient manner. With their abilities to change channels and charge devices, two emerging techniques, reconfigurable intelligent surface and wireless energy transfer, are merged with our SemiFL framework to enhance its performance in bandwidth- and energy-limited IoT networks, respectively. Simulation results are presented to demonstrate the superiority of our SemiFL for achieving edge intelligence among computing-heterogeneous IoT devices.

5/29/2024

cs.NI cs.SY eess.SY

📊

Enhancing Efficiency in Multidevice Federated Learning through Data Selection

Fan Mo, Mohammad Malekzadeh, Soumyajit Chatterjee, Fahim Kawsar, Akhil Mathur

Federated learning (FL) in multidevice environments creates new opportunities to learn from a vast and diverse amount of private data. Although personal devices capture valuable data, their memory, computing, connectivity, and battery resources are often limited. Since deep neural networks (DNNs) are the typical machine learning models employed in FL, there are demands for integrating ubiquitous constrained devices into the training process of DNNs. In this paper, we develop an FL framework to incorporate on-device data selection on such constrained devices, which allows partition-based training of a DNN through collaboration between constrained devices and resourceful devices of the same client. Evaluations on five benchmark DNNs and six benchmark datasets across different modalities show that, on average, our framework achieves ~19% higher accuracy and ~58% lower latency; compared to the baseline FL without our implemented strategies. We demonstrate the effectiveness of our FL framework when dealing with imbalanced data, client participation heterogeneity, and various mobility patterns. As a benchmark for the community, our code is available at https://github.com/dr-bell/data-centric-federated-learning

4/11/2024

cs.LG