Japan Geoscience Union Meeting 2023

Presentation information

[J] Online Poster

P (Space and Planetary Sciences ) » P-EM Solar-Terrestrial Sciences, Space Electromagnetism & Space Environment

[P-EM17] Space Plasma Physics: Theory and Simulation

Tue. May 23, 2023 9:00 AM - 10:30 AM Online Poster Zoom Room (2) (Online Poster)

convener:Takanobu Amano(Department of Earth and Planetary Science, University of Tokyo), Yohei Miyake(Graduate School of System Informatics, Kobe University), Takayuki Umeda(Institute for Space-Earth Environmental Research, Nagoya University), Tadas Nakamura(Fukui Prefectural University)

On-site poster schedule(2023/5/22 17:15-18:45)

9:00 AM - 10:30 AM

[PEM17-P13] Performance Evaluation of Load-balanced Particle-based Plasma Kinetic Simulations

*Takanobu Amano1, Yosuke Matsumoto2 (1.Department of Earth and Planetary Science, University of Tokyo, 2.Institute for Advanced Academic Resaerch, Chiba University)

Keywords:plasma, numerical simulation, Particle-In-Cell method

Plasma kinetic simulation has been an indispensable tool for laboratory, space, and astrophysical plasma modeling for a long time. To mitigate the inherently-large computational demand in solving Vlasov or Boltzmann equation in six-dimensional phase space, the standard Particle-In-Cell (PIC) scheme represents the phase space density using aggregation of computational particles. On the other hand, the electromagnetic field is defined on a mesh. The computational particles interact with the mesh quantities but move freely in continuous phase space across the mesh boundaries. The difference in the data structures between particles and fields makes it challenging to fully exploit the performance of modern supercomputers, which typically consist of a hierarchy of parallelism.

We have recently developed a general dynamic load-balancing framework for particle-based kinetic simulations in C++. It is based on decomposing the computational domain into small chunks [e.g., Germaschewski et al., 2016, Derouillat et al., 2018, Rowan et al., 2021]. The load balancing is performed by distributing the chunks into MPI ranks so that each rank's computational load becomes equal as much as possible. Therefore, the chunk size and the number of chunks per MPI process are essential parameters controlling the efficiency of dynamic load balancing. Furthermore, it has been well-recognized that the chunk size also affects the single-core performance because a smaller chunk makes it possible to use CPU cash more efficiently.

The motivation of this work is to evaluate the performance of a standard PIC code built on top of the newly developed framework. We evaluate the load-balancing performance as a function of chunk size for problems with highly-inhomogeneous density, such as magnetic reconnection. Flat-MPI and OpenMP-MPI hybrid parallelization strategies will be tested on Intel Xeon and Fujitsu A64FX architectures. SIMD optimization and efficient cash usage may also be discussed.