FNCC: Fast Notification Congestion Control in Data Center Networks

2405.07608

Published 5/28/2024 by Jing Xu, Zhan Wang, Fan Yang, Ning Kang, Zhenlong Ma, Guojun Yuan, Guangming Tan, Ninghui Sun

📊

Abstract

Congestion control plays a pivotal role in large-scale data centers, facilitating ultra-low latency, high bandwidth, and optimal utilization. Even with the deployment of data center congestion control mechanisms such as DCQCN and HPCC, these algorithms often respond to congestion sluggishly. This sluggishness is primarily due to the slow notification of congestion. It takes almost one round-trip time (RTT) for the congestion information to reach the sender. In this paper, we introduce the Fast Notification Congestion Control (FNCC) mechanism, which achieves sub-RTT notification. FNCC leverages the acknowledgment packet (ACK) from the return path to carry in-network telemetry (INT) information of the request path, offering the sender more timely and accurate INT. To further accelerate the responsiveness of last-hop congestion control, we propose that the receiver notifies the sender of the number of concurrent congested flows, which can be used to adjust the congested flows to a fair rate quickly. Our experimental results demonstrate that FNCC reduces flow completion time by 27.4% and 88.9% compared to HPCC and DCQCN, respectively. Moreover, FNCC triggers minimal pause frames and maintains high utilization even at 400Gbps.

Create account to get full access

Overview

The paper introduces a new congestion control mechanism called Fast Notification Congestion Control (FNCC) to address the sluggish response of existing data center congestion control algorithms.
FNCC leverages in-network telemetry (INT) information carried in acknowledgment (ACK) packets to provide the sender with more timely and accurate congestion information.
The paper also proposes a receiver-based mechanism to further improve the responsiveness of last-hop congestion control.
Experimental results show that FNCC outperforms existing algorithms in reducing flow completion time and maintaining high utilization.

Plain English Explanation

Data centers play a crucial role in our modern, digital world, handling massive amounts of data and powering various online services. [Queue-Aware Network Control Algorithm for High Quantum] and [Fully Energy-Efficient Randomized Backoff with Slow Feedback] are two algorithms used to manage congestion in these data centers, ensuring low latency, high bandwidth, and optimal resource utilization.

However, even with these algorithms in place, there is often a delay in detecting and responding to network congestion. This delay is due to the time it takes for the congestion information to reach the sender, which can take roughly one round-trip time (RTT). [SmartT-Reps: Sender-Based Marked Rapidly Adapting] and [Closed-Form Congestion Control via Deep Symbolic] are other techniques that aim to address this issue.

The researchers in this paper introduce a new approach called Fast Notification Congestion Control (FNCC), which can detect and respond to congestion much faster than existing methods. FNCC uses the acknowledgment (ACK) packets sent back from the receiver to carry information about the network conditions along the request path. This allows the sender to get a more accurate and timely understanding of the network state, enabling them to adjust their sending rate more quickly.

Additionally, the paper proposes a mechanism where the receiver notifies the sender of the number of concurrent congested flows. This information can be used by the sender to quickly adjust the rates of the congested flows to a fair level.

The researchers' experiments show that FNCC can reduce flow completion time by 27.4% and 88.9% compared to the [Accelerating Delivery of Data Services over Uncertain Mobile] and DCQCN algorithms, respectively. FNCC also maintains high network utilization, even at very high data transfer speeds of 400 Gbps.

Technical Explanation

The paper introduces the Fast Notification Congestion Control (FNCC) mechanism to address the sluggish response of existing data center congestion control algorithms, such as DCQCN and HPCC. The key issue with these algorithms is that they take almost one round-trip time (RTT) to receive the congestion information from the network, delaying their ability to respond effectively.

FNCC leverages the acknowledgment (ACK) packets sent from the receiver to the sender to carry in-network telemetry (INT) information about the request path. This allows the sender to receive more timely and accurate congestion data, enabling a faster response.

To further improve the responsiveness of last-hop congestion control, the paper also proposes a receiver-based mechanism. The receiver notifies the sender of the number of concurrent congested flows, which the sender can then use to quickly adjust the rates of the congested flows to a fair level.

The researchers conducted experiments to evaluate the performance of FNCC compared to HPCC and DCQCN. The results show that FNCC reduces flow completion time by 27.4% and 88.9%, respectively, while also maintaining high network utilization even at 400Gbps speeds.

Critical Analysis

The paper presents a novel approach to data center congestion control that addresses the key limitation of slow congestion notification in existing algorithms. By leveraging the acknowledgment path to carry more timely and accurate network telemetry information, FNCC can respond to congestion much faster than previous methods.

However, the paper does not delve into the potential overhead or complexity introduced by the additional functionality in the ACK packets. There may be concerns about the scalability of this approach, especially in high-throughput, low-latency environments where the volume of ACK traffic could become significant.

Additionally, the paper does not provide a detailed analysis of the fairness and stability properties of FNCC. While the experiments demonstrate performance improvements, further research may be needed to understand how FNCC behaves under more diverse network conditions and traffic patterns.

Overall, the FNCC mechanism presents a promising direction for improving data center congestion control, but the authors may need to address potential scalability and robustness concerns in future work.

Conclusion

The Fast Notification Congestion Control (FNCC) mechanism introduced in this paper represents a significant advancement in data center congestion control. By leveraging the acknowledgment path to provide the sender with more timely and accurate network telemetry information, FNCC can respond to congestion much more quickly than existing algorithms.

The experimental results demonstrate that FNCC can substantially reduce flow completion time and maintain high network utilization, even at very high data transfer speeds. This improvement in performance could have far-reaching implications for the development of highly efficient and responsive data center networks, enabling the delivery of low-latency, high-bandwidth services at scale.

While the paper identifies some potential areas for further research, the FNCC approach shows promise as a valuable contribution to the field of data center congestion control.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🏅

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Benjamin Fuhrer, Yuval Shpigelman, Chen Tessler, Shie Mannor, Gal Chechik, Eitan Zahavi, Gal Dalal

As communication protocols evolve, datacenter network utilization increases. As a result, congestion is more frequent, causing higher latency and packet loss. Combined with the increasing complexity of workloads, manual design of congestion control (CC) algorithms becomes extremely difficult. This calls for the development of AI approaches to replace the human effort. Unfortunately, it is currently not possible to deploy AI models on network devices due to their limited computational capabilities. Here, we offer a solution to this problem by building a computationally-light solution based on a recent reinforcement learning CC algorithm [arXiv:2207.02295]. We reduce the inference time of RL-CC by x500 by distilling its complex neural network into decision trees. This transformation enables real-time inference within the $mu$-sec decision-time requirement, with a negligible effect on quality. We deploy the transformed policy on NVIDIA NICs in a live cluster. Compared to popular CC algorithms used in production, RL-CC is the only method that performs well on all benchmarks tested over a large range of number of flows. It balances multiple metrics simultaneously: bandwidth, latency, and packet drops. These results suggest that data-driven methods for CC are feasible, challenging the prior belief that handcrafted heuristics are necessary to achieve optimal performance.

6/4/2024

cs.NI cs.AI cs.LG

PET: Multi-agent Independent PPO-based Automatic ECN Tuning for High-Speed Data Center Networks

Kai Cheng, Ting Wang, Xiao Du, Shuyi Du, Haibin Cai

Explicit Congestion Notification (ECN)-based congestion control schemes have been widely adopted in high-speed data center networks (DCNs), where the ECN marking threshold plays a determinant role in guaranteeing a packet lossless DCN. However, existing approaches either employ static settings with immutable thresholds that cannot be dynamically self-adjusted to adapt to network dynamics, or fail to take into account many-to-one traffic patterns and different requirements of different types of traffic, resulting in relatively poor performance. To address these problems, this paper proposes a novel learning-based automatic ECN tuning scheme, named PET, based on the multi-agent Independent Proximal Policy Optimization (IPPO) algorithm. PET dynamically adjusts ECN thresholds by fully considering pivotal congestion-contributing factors, including queue length, output data rate, output rate of ECN-marked packets, current ECN threshold, the extent of incast, and the ratio of mice and elephant flows. PET adopts the Decentralized Training and Decentralized Execution (DTDE) paradigm and combines offline and online training to accommodate network dynamics. PET is also fair and readily deployable with commodity hardware. Comprehensive experimental results demonstrate that, compared with state-of-the-art static schemes and the learning-based automatic scheme, our PET achieves better performance in terms of flow completion time, convergence rate, queue length variance, and system robustness.

5/21/2024

cs.NI

🌀

Hercules: Heterogeneous Requirements Congestion Control Protocol

Neta Rozen-Schiff, Itzcak Pechtalt, Amit Navon, Leon Bruckman

Future network services present a significant challenge for network providers due to high number and high variety of co-existing requirements. Despite many advancements in network architectures and management schemes, congested network links continue to constrain the Quality of Service (QoS) for critical applications like tele-surgery and autonomous driving. A prominent, complimentary approach consists of congestion control (CC) protocols which regulate bandwidth at the endpoints before network congestion occurs. However, existing CC protocols, including recent ones, are primarily designed to handle small numbers of requirement classes, highlighting the need for a more granular and flexible congestion control solution. In this paper we introduce Hercules, a novel CC protocol designed to handle heterogeneous requirements. Hercules is based on an online learning approach and has the capability to support any combination of requirements within an unbounded and continuous requirements space. We have implemented Hercules as a QUIC module and demonstrate, through extensive analysis and real-world experiments, that Hercules can achieve up to 3.5-fold improvement in QoS compared to state-of-the-art CC protocols.

6/6/2024

cs.NI

Flow Optimization at Inter-Datacenter Networks for Application Run-time Acceleration

Berta Serracanta, Alberto Rodriguez-Natal, Fabio Maino, Albert Cabellos

In the present-day, distributed applications are commonly spread across multiple datacenters, reaching out to edge and fog computing locations. The transition away from single datacenter hosting is driven by capacity constraints in datacenters and the adoption of hybrid deployment strategies, combining on-premise and public cloud facilities. However, the performance of such applications is often limited by extended Flow Completion Times (FCT) for short flows due to queuing behind bursts of packets from concurrent long flows. To address this challenge, we propose a solution to prioritize short flows over long flows in the Software-Defined Wide-Area Network (SD-WAN) interconnecting the distributed computing platforms. Our solution utilizes eBPF to segregate short and long flows, transmitting them over separate tunnels with the same properties. By effectively mitigating queuing delays, we consistently achieve a 1.5 times reduction in FCT for short flows, resulting in improved application response times. The proposed solution works with encrypted traffic and is application-agnostic, making it deployable in diverse distributed environments without modifying the applications themselves. Our testbed evaluation demonstrates the effectiveness of our approach in accelerating the run-time of distributed applications, providing valuable insights for optimizing multi-datacenter and edge deployments.

6/19/2024

cs.NI