Performance characterisation of the 64-core SG2042 RISC-V CPU for HPC

Read original: arXiv:2406.12394 - Published 6/19/2024 by Nick Brown, Maurice Jamieson
Total Score

0

Performance characterisation of the 64-core SG2042 RISC-V CPU for HPC

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • The provided paper discusses the performance characterization of the 64-core SG2042 RISC-V CPU for high-performance computing (HPC) applications.
  • The paper aims to evaluate the performance and scalability of this RISC-V-based CPU in handling HPC workloads.
  • Experiments are conducted using the NAS Parallel Benchmark (NPB) suite, which is a widely used benchmark for assessing the performance of HPC systems.

Plain English Explanation

The paper examines the performance of a new 64-core RISC-V-based CPU called the SG2042, which is designed for use in high-performance computing (HPC) systems. The researchers wanted to see how well this new CPU can handle demanding computational tasks that are common in HPC applications.

To test the CPU's performance, the researchers used a well-known benchmark called the NAS Parallel Benchmark (NPB). The NPB suite includes a variety of computational workloads that are representative of the kinds of tasks HPC systems are often used for, such as simulating fluid dynamics or training large machine learning models.

By running these benchmark tests on the SG2042 CPU, the researchers could evaluate how fast and efficient the CPU is at completing these HPC-related computations. This information is valuable for understanding the CPU's potential applications in real-world HPC systems, such as those used for weather forecasting or scientific simulations.

Technical Explanation

The paper describes experiments that characterized the performance of the 64-core SG2042 RISC-V CPU for HPC workloads using the NAS Parallel Benchmark (NPB) suite. The NPB suite includes a variety of parallel computational kernels that are representative of real-world HPC applications, such as computational fluid dynamics, molecular dynamics, and weather forecasting.

The researchers ran the different NPB benchmark tests on the SG2042 system and measured various performance metrics, including execution time, instructions per cycle (IPC), and parallel efficiency. They also compared the SG2042's performance to that of other HPC systems, such as x86-based CPUs and GPUs, to provide context for interpreting the results.

The paper presents detailed performance analysis of the SG2042 across the different NPB benchmark tests, exploring how the CPU's performance scales with the number of active cores. The researchers also investigated the impact of various architectural features, such as cache sizes and memory bandwidth, on the CPU's ability to handle the HPC workloads efficiently.

Critical Analysis

The paper provides a thorough and systematic evaluation of the SG2042 RISC-V CPU's performance for HPC applications. The use of the well-established NPB benchmark suite is a strength, as it allows for direct comparisons to other HPC systems.

However, the paper does not delve into the potential limitations or caveats of the SG2042 architecture. For example, it would be interesting to understand how the CPU's power consumption and energy efficiency compare to alternative HPC solutions, as this is an important consideration for real-world HPC deployments.

Additionally, the paper could have explored the performance implications of the RISC-V instruction set architecture (ISA) compared to more established ISAs like x86 or ARM. Understanding the trade-offs and unique characteristics of the RISC-V ISA in the context of HPC workloads would provide valuable insight for researchers and system architects.

Conclusion

The paper presents a comprehensive performance evaluation of the 64-core SG2042 RISC-V CPU for HPC applications using the NAS Parallel Benchmark suite. The results demonstrate the CPU's ability to handle a variety of computationally intensive HPC workloads and provide valuable insights into its scalability and architectural trade-offs.

The findings from this research can inform the design and deployment of future RISC-V-based HPC systems, contributing to the ongoing efforts to diversify the HPC landscape beyond traditional x86 and GPU-based architectures. As the RISC-V ecosystem continues to evolve, papers like this one will play an important role in understanding the performance characteristics and potential applications of these emerging CPU designs.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on ๐• โ†’

Related Papers

Performance characterisation of the 64-core SG2042 RISC-V CPU for HPC
Total Score

0

Performance characterisation of the 64-core SG2042 RISC-V CPU for HPC

Nick Brown, Maurice Jamieson

Whilst RISC-V has grown phenomenally quickly in embedded computing, it is yet to gain significant traction in High Performance Computing (HPC). However, as we move further into the exascale era, the flexibility offered by RISC-V has the potential to be very beneficial in future supercomputers especially as the community places an increased emphasis on decarbonising its workloads. Sophon's SG2042 is the first mass produced, commodity available, high-core count RISC-V CPU designed for high performance workloads. First released in summer 2023, and at the time of writing now becoming widely available, a key question is whether this is a realistic proposition for HPC applications. In this paper we use NASA's NAS Parallel Benchmark (NPB) suite to characterise performance of the SG2042 against other CPUs implementing the RISC-V, x86-64, and AArch64 ISAs. We find that the SG2042 consistently outperforms all other RISC-V solutions, delivering between a 2.6 and 16.7 performance improvement at the single core level. When compared against the x86-64 and AArch64 CPUs, which are commonplace for high performance workloads, we find that the SG2042 performs comparatively well with computationally bound algorithms but decreases in relative performance when the algorithms are memory bandwidth or latency bound. Based on this work, we identify that performance of the SG2042's memory subsystem is the greatest bottleneck.

Read more

6/19/2024

๐Ÿงช

Total Score

0

RISC-V for HPC: Where we are and where we need to go

Nick Brown

Funded by the UK ExCALIBUR H&ES exascale programme, since early 2022 we have provided a RISC-V testbed for HPC to offer free access for scientific software developers to experiment with RISC-V for their workloads. Based upon our experiences of providing access to RISC-V for the HPC community, and our involvement with the RISC-V community at large, in this extended abstract we summarise the current state of RISC-V for HPC and consider the high priority areas that should be addressed to help drive adoption.

Read more

6/19/2024

๐Ÿ”

Total Score

0

Distributed astrophysics simulations using Octo-Tiger with RISC-V CPUs using HPX and Kokkos

Patrick Diehl, Panagiotis Syskakis, Gregor Dai{ss}, Steven R. Brandt, Alireza Kheirkhahan, Srinivas Yadav Singanaboina, Dominic Marcello, Chris Taylor, John Leidel, Hartmut Kaiser

In recent years, interest in RISC-V computing architectures has moved from academic to mainstream, especially in the field of High Performance Computing where energy limitations are increasingly a concern. As of this year, the first single board RISC-V CPUs implementing the finalized ratified vector specification are being released. The RISC-V vector specification follows in the tradition of vector processors found in the CDC STAR-100, the Cray-1, the Convex C-Series, and the NEC SX machines and accelerators. The family of vector processors offers support for variable-length array processing as opposed to the fixed-length processing functionality offered by SIMD. Vector processors offer opportunities to perform vector-chaining which allows temporary results to be used without the need to resolve memory references. In this work, we use the Octo-Tiger multi-physics, multi-scale, 3D adaptive mesh refinement astrophysics application to study these early RISC-V chips with vector machine support. We report on our experience in porting this modern C++ code (which is built upon several open-source libraries such as HPX and Kokkos) to RISC-V. In addition, we show the impact of the RISC-V Vector extension on a RISC-V single board computer by implementing the std::experimental:simd interface and integrating it with our code. We also compare the application's performance, scalability, and power consumption on desktop-grade RISC-V computer to an A64FX system.

Read more

8/16/2024

๐Ÿค–

Total Score

7

Microarchitectural comparison and in-core modeling of state-of-the-art CPUs: Grace, Sapphire Rapids, and Genoa

Jan Laukemann, Georg Hager, Gerhard Wellein

With Nvidia's release of the Grace Superchip, all three big semiconductor companies in HPC (AMD, Intel, Nvidia) are currently competing in the race for the best CPU. In this work we analyze the performance of these state-of-the-art CPUs and create an accurate in-core performance model for their microarchitectures Zen 4, Golden Cove, and Neoverse V2, extending the Open Source Architecture Code Analyzer (OSACA) tool and comparing it with LLVM-MCA. Starting from the peculiarities and up- and downsides of a single core, we extend our comparison by a variety of microbenchmarks and the capabilities of a full node. The write-allocate (WA) evasion feature, which can automatically reduce the memory traffic caused by write misses, receives special attention; we show that the Grace Superchip has a next-to-optimal implementation of WA evasion, and that the only way to avoid write allocates on Zen 4 is the explicit use of non-temporal stores.

Read more

9/14/2024