A Survey on the Use of Partitioning in IoT-Edge-AI Applications

2406.00301

Published 6/4/2024 by Guoxing Yao, Lav Gupta

A Survey on the Use of Partitioning in IoT-Edge-AI Applications

Abstract

Centralized clouds processing the large amount of data generated by Internet-of-Things (IoT) can lead to unacceptable latencies for the end user. Against this backdrop, Edge Computing (EC) is an emerging paradigm that can address the shortcomings of traditional centralized Cloud Computing (CC). Its use is associated with improved performance, productivity, and security. Some of its use cases include smart grids, healthcare Augmented Reality (AR)/Virtual Reality (VR). EC uses servers strategically placed near end users, reducing latency and proving to be particularly well-suited for time-sensitive IoT applications. It is expected to play a pivotal role in 6G and Industry 5.0. Within the IoT-edge environment, artificial intelligence (AI) plays an important role in automating decision and control, including but not limited to resource allocation activities, drawing inferences from large volumes of data, and enabling powerful security mechanisms. The use cases in the IoT-Edge-cloud environment tend to be complex resulting in large AI models, big datasets, and complex computations. This has led to researchers proposing techniques that partition data, tasks, models, or hybrid to achieve speed, efficiency, and accuracy of processing. This survey comprehensively explores the IoT-Edge-AI environment, application cases, and the partitioning techniques used. We categorize partitioning techniques and compare their performance. The survey concludes by identifying open research challenges in this domain.

Create account to get full access

Overview

This paper provides a comprehensive survey on the use of partitioning techniques in IoT-Edge-AI applications.
Partitioning refers to the process of dividing data, models, or tasks across different computing resources (e.g., IoT devices, edge servers, cloud) to optimize performance, energy efficiency, and cost.
The survey examines various partitioning approaches, including data partitioning, model partitioning, and task partitioning, and their impact on IoT-Edge-AI applications.

Plain English Explanation

The paper discusses the concept of partitioning, which involves dividing up data, machine learning models, or computational tasks across different devices and servers in an IoT (Internet of Things) and edge computing system. This partitioning can help optimize the performance, energy efficiency, and cost of these AI-powered IoT applications.

For example, in an IoT system that uses edge computing, sensor data could be processed locally on the IoT devices themselves (data partitioning), while more complex machine learning models could be split and run partially on the edge servers and partially in the cloud (model partitioning). The tasks of collecting data, preprocessing it, running models, and delivering insights could also be divided up across the different devices and servers (task partitioning).

By carefully partitioning the workload, the system can take advantage of the strengths of each computing resource - the low latency and energy efficiency of the IoT devices, the increased processing power of the edge servers, and the virtually unlimited storage and computation of the cloud. This can lead to more responsive, efficient, and cost-effective IoT-Edge-AI applications.

Technical Explanation

The paper examines various partitioning approaches and their impact on IoT-Edge-AI applications:

[object Object]: This involves dividing up the sensor data collected by IoT devices and processing it across the IoT, edge, and cloud layers. For example, simple data processing tasks could be performed on the IoT devices, while more complex analysis is done on the edge servers or in the cloud.

[object Object]: Machine learning models can be split across the IoT, edge, and cloud, with different components of the model running on different devices. This allows large, complex models to be used in resource-constrained IoT environments.

[object Object]: The overall computational tasks, such as data collection, preprocessing, model inference, and result delivery, can be divided up and assigned to different devices based on their capabilities. This can improve the overall efficiency and responsiveness of the system.

The paper provides a comprehensive review of the various partitioning techniques, their trade-offs, and their impact on performance, energy efficiency, and cost in IoT-Edge-AI applications. It also discusses the challenges and open research questions in this area.

Critical Analysis

The paper provides a thorough and well-researched survey of partitioning techniques in IoT-Edge-AI applications. It acknowledges the potential limitations of the current approaches, such as the need for advanced coordination and management mechanisms to effectively partition the workload across the different computing resources.

One area that could be explored further is the impact of dynamic and adaptive partitioning, where the partitioning strategy is adjusted in real-time based on factors like changing resource availability, workload, or application requirements. This could help improve the overall resilience and responsiveness of the system.

Additionally, the paper does not delve deep into the security and privacy implications of partitioning sensitive data and models across multiple devices and servers. This is an important consideration, especially in critical IoT applications, and should be explored further in future research.

Conclusion

This survey paper provides a comprehensive overview of the use of partitioning techniques in IoT-Edge-AI applications. It highlights the potential benefits of data, model, and task partitioning in terms of performance, energy efficiency, and cost optimization. The insights from this paper can help guide the development of more efficient and effective IoT systems that leverage the combined power of IoT devices, edge computing, and cloud resources.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

❗

The Future of Consumer Edge-AI Computing

Stefanos Laskaridis, Stylianos I. Venieris, Alexandros Kouris, Rui Li, Nicholas D. Lane

In the last decade, Deep Learning has rapidly infiltrated the consumer end, mainly thanks to hardware acceleration across devices. However, as we look towards the future, it is evident that isolated hardware will be insufficient. Increasingly complex AI tasks demand shared resources, cross-device collaboration, and multiple data types, all without compromising user privacy or quality of experience. To address this, we introduce a novel paradigm centered around EdgeAI-Hub devices, designed to reorganise and optimise compute resources and data access at the consumer edge. To this end, we lay a holistic foundation for the transition from on-device to Edge-AI serving systems in consumer environments, detailing their components, structure, challenges and opportunities.

6/19/2024

cs.LG

👨‍🏫

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence

Dingzhu Wen, Xiaoyang Li, Yong Zhou, Yuanming Shi, Sheng Wu, Chunxiao Jiang

Edge artificial intelligence (AI) has been a promising solution towards 6G to empower a series of advanced techniques such as digital twins, holographic projection, semantic communications, and auto-driving, for achieving intelligence of everything. The performance of edge AI tasks, including edge learning and edge AI inference, depends on the quality of three highly coupled processes, i.e., sensing for data acquisition, computation for information extraction, and communication for information transmission. However, these three modules need to compete for network resources for enhancing their own quality-of-services. To this end, integrated sensing-communication-computation (ISCC) is of paramount significance for improving resource utilization as well as achieving the customized goals of edge AI tasks. By investigating the interplay among the three modules, this article presents various kinds of ISCC schemes for federated edge learning tasks and edge AI inference tasks in both application and physical layers.

4/19/2024

cs.IT cs.AI cs.LG

Implementation of Big AI Models for Wireless Networks with Collaborative Edge Computing

Liekang Zeng, Shengyuan Ye, Xu Chen, Yang Yang

Big Artificial Intelligence (AI) models have emerged as a crucial element in various intelligent applications at the edge, such as voice assistants in smart homes and autonomous robotics in smart factories. Training big AI models, e.g., for personalized fine-tuning and continual model refinement, poses significant challenges to edge devices due to the inherent conflict between limited computing resources and intensive workload associated with training. Despite the constraints of on-device training, traditional approaches usually resort to aggregating training data and sending it to a remote cloud for centralized training. Nevertheless, this approach is neither sustainable, which strains long-range backhaul transmission and energy-consuming datacenters, nor safely private, which shares users' raw data with remote infrastructures. To address these challenges, we alternatively observe that prevalent edge environments usually contain a diverse collection of trusted edge devices with untapped idle resources, which can be leveraged for edge training acceleration. Motivated by this, in this article, we propose collaborative edge training, a novel training mechanism that orchestrates a group of trusted edge devices as a resource pool for expedited, sustainable big AI model training at the edge. As an initial step, we present a comprehensive framework for building collaborative edge training systems and analyze in-depth its merits and sustainable scheduling choices following its workflow. To further investigate the impact of its parallelism design, we empirically study a case of four typical parallelisms from the perspective of energy demand with realistic testbeds. Finally, we discuss open challenges for sustainable collaborative edge training to point to future directions of edge-centric big AI model training.

4/30/2024

cs.LG cs.AI cs.DC cs.NI

Urgent Edge Computing

Patrizio Dazzi, Luca Ferrucci, Marco Danelutto, Konstantinos Tserpes, Antonis Makris, Theodoros Theodoropoulos, Jacopo Massa, Emanuele Carlini, Matteo Mordacchini

This position paper introduces Urgent Edge Computing (UEC) as a paradigm shift addressing the evolving demands of time-sensitive applications in distributed edge environments, in time-critical scenarios. With a focus on ultra-low latency, availability, resource management, decentralization, self-organization, and robust security, UEC aims to facilitate operations in critical scenarios such as disaster response, environmental monitoring, and smart city management. This paper outlines and discusses the key requirements, challenges, and enablers along with a conceptual architecture. The paper also outlines the potential applications of Urgent Edge Computing

4/23/2024

cs.NI cs.DC