RISC-V for HPC: Where we are and where we need to go

Read original: arXiv:2406.12398 - Published 6/19/2024 by Nick Brown
Total Score

0

๐Ÿงช

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • RISC-V is a free and open-source instruction set architecture (ISA) that has gained significant attention in the high-performance computing (HPC) community.
  • This paper discusses the current state of RISC-V in HPC and the areas that still need to be addressed to make it a viable option for large-scale HPC systems.

Plain English Explanation

RISC-V is a new type of computer chip design that is open for anyone to use and improve. This is different from the more common chip designs, which are owned by specific companies. The paper looks at how RISC-V is being used in high-performance computing (HPC), which is the field of building very powerful computers for things like scientific research and weather forecasting.

The paper explains where RISC-V is at right now in terms of HPC, and what still needs to be done to make it a good choice for building the biggest and most powerful HPC systems. It covers the current capabilities of RISC-V, as well as the areas that need more work, like performance-characterisation-64-core-sg2042-risc-v, full-stack-evaluation-machine-learning-inference-workloads, hpc-alongside-user-space-kubernetes, fork-is-all-you-needed-heterogeneous-systems, and vectorization-gradient-boosting-decision-trees-prediction-catboost.

Technical Explanation

The paper provides an overview of the current state of RISC-V in the HPC domain. It discusses the key features and capabilities of RISC-V that make it a promising alternative to traditional CPU architectures for HPC applications.

The paper highlights several research efforts that have explored the performance of RISC-V-based systems for HPC workloads, such as performance-characterisation-64-core-sg2042-risc-v, full-stack-evaluation-machine-learning-inference-workloads, and hpc-alongside-user-space-kubernetes. These studies provide insights into the strengths and limitations of RISC-V in terms of performance, scalability, and integration with HPC software stacks.

The paper also discusses the role of heterogeneous computing and the potential of RISC-V to enable fork-is-all-you-needed-heterogeneous-systems for HPC workloads. Additionally, it touches on the importance of efficient vector processing and the work being done in areas like vectorization-gradient-boosting-decision-trees-prediction-catboost to optimize RISC-V for HPC applications.

Critical Analysis

The paper provides a comprehensive overview of the current state of RISC-V in the HPC domain, highlighting both the progress made and the challenges that still need to be addressed. One potential limitation is that the paper does not delve deeply into the specific performance characteristics and trade-offs of RISC-V compared to other CPU architectures commonly used in HPC, such as x86 or ARM. Additionally, the paper could have explored the potential impact of RISC-V on the broader HPC ecosystem, including the implications for software development, system integration, and the competitive landscape.

Despite these minor limitations, the paper offers a valuable contribution to the ongoing discussion around the role of RISC-V in HPC. It encourages readers to think critically about the capabilities and limitations of RISC-V, and to consider the areas that require further research and development to make RISC-V a viable option for large-scale HPC systems.

Conclusion

The paper provides a comprehensive overview of the current state of RISC-V in the HPC domain, highlighting both the progress made and the challenges that still need to be addressed. It covers a range of research efforts that have explored the performance and integration of RISC-V-based systems for HPC workloads, demonstrating the potential of this open-source architecture to disrupt the traditional HPC landscape.

While RISC-V has made significant strides in recent years, the paper underscores the need for continued research and development to address critical areas such as performance, scalability, and software ecosystem integration. By addressing these challenges, the RISC-V community can work towards making it a more attractive and viable option for large-scale HPC systems, with potential implications for scientific research, weather forecasting, and other data-intensive applications.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on ๐• โ†’

Related Papers

๐Ÿงช

Total Score

0

RISC-V for HPC: Where we are and where we need to go

Nick Brown

Funded by the UK ExCALIBUR H&ES exascale programme, since early 2022 we have provided a RISC-V testbed for HPC to offer free access for scientific software developers to experiment with RISC-V for their workloads. Based upon our experiences of providing access to RISC-V for the HPC community, and our involvement with the RISC-V community at large, in this extended abstract we summarise the current state of RISC-V for HPC and consider the high priority areas that should be addressed to help drive adoption.

Read more

6/19/2024

Performance characterisation of the 64-core SG2042 RISC-V CPU for HPC
Total Score

0

Performance characterisation of the 64-core SG2042 RISC-V CPU for HPC

Nick Brown, Maurice Jamieson

Whilst RISC-V has grown phenomenally quickly in embedded computing, it is yet to gain significant traction in High Performance Computing (HPC). However, as we move further into the exascale era, the flexibility offered by RISC-V has the potential to be very beneficial in future supercomputers especially as the community places an increased emphasis on decarbonising its workloads. Sophon's SG2042 is the first mass produced, commodity available, high-core count RISC-V CPU designed for high performance workloads. First released in summer 2023, and at the time of writing now becoming widely available, a key question is whether this is a realistic proposition for HPC applications. In this paper we use NASA's NAS Parallel Benchmark (NPB) suite to characterise performance of the SG2042 against other CPUs implementing the RISC-V, x86-64, and AArch64 ISAs. We find that the SG2042 consistently outperforms all other RISC-V solutions, delivering between a 2.6 and 16.7 performance improvement at the single core level. When compared against the x86-64 and AArch64 CPUs, which are commonplace for high performance workloads, we find that the SG2042 performs comparatively well with computationally bound algorithms but decreases in relative performance when the algorithms are memory bandwidth or latency bound. Based on this work, we identify that performance of the SG2042's memory subsystem is the greatest bottleneck.

Read more

6/19/2024

๐Ÿ”

Total Score

0

Distributed astrophysics simulations using Octo-Tiger with RISC-V CPUs using HPX and Kokkos

Patrick Diehl, Panagiotis Syskakis, Gregor Dai{ss}, Steven R. Brandt, Alireza Kheirkhahan, Srinivas Yadav Singanaboina, Dominic Marcello, Chris Taylor, John Leidel, Hartmut Kaiser

In recent years, interest in RISC-V computing architectures has moved from academic to mainstream, especially in the field of High Performance Computing where energy limitations are increasingly a concern. As of this year, the first single board RISC-V CPUs implementing the finalized ratified vector specification are being released. The RISC-V vector specification follows in the tradition of vector processors found in the CDC STAR-100, the Cray-1, the Convex C-Series, and the NEC SX machines and accelerators. The family of vector processors offers support for variable-length array processing as opposed to the fixed-length processing functionality offered by SIMD. Vector processors offer opportunities to perform vector-chaining which allows temporary results to be used without the need to resolve memory references. In this work, we use the Octo-Tiger multi-physics, multi-scale, 3D adaptive mesh refinement astrophysics application to study these early RISC-V chips with vector machine support. We report on our experience in porting this modern C++ code (which is built upon several open-source libraries such as HPX and Kokkos) to RISC-V. In addition, we show the impact of the RISC-V Vector extension on a RISC-V single board computer by implementing the std::experimental:simd interface and integrating it with our code. We also compare the application's performance, scalability, and power consumption on desktop-grade RISC-V computer to an A64FX system.

Read more

8/16/2024

Enabling an OpenStack-based cloud on top of RISC-V hardware
Total Score

0

Enabling an OpenStack-based cloud on top of RISC-V hardware

Diego Marr'on, Aaron Call, Josep Ll. Berral, Ramon Nou

The European Union's technological sovereignty strategy centers around the RISC-V Instruction Set Architecture, with the European Processor Initiative leading efforts to build production-ready processors. Focusing on realizing a functional RISC-V cloud ecosystem, the Vitamin-V European project developed an OpenStack cluster utilizing genuine hardware. In this poster, we detail the efforts done in porting and setting up the cluster and the many software services required by OpenStack to properly run on real hardware. In this poster, we detail our efforts on building an minimal viable prototype OpenStack cluster using real hardware. The cluster is almost functional, and we expect it to be complete in the next few months.

Read more

7/18/2024