GPU並列化の地球物理系シミュレーションへの実装と評価

迫田 祥司; 深沢 圭一郎; 三好 勉信; 岩下 武史

17:15 〜 19:15

[MGI31-P03] GPU並列化の地球物理系シミュレーションへの実装と評価

*迫田祥司¹、深沢圭一郎²、三好勉信⁴、岩下武史³ (1.京都大学大学院情報学研究科、2.総合地球環境学研究所、3. 京都大学学術情報メディアセンター、4.九州大学大学院理学研究院地球惑星科学部門)

キーワード：並列化、GPGPU

Numerical simulations play a decisive role in validating theories and computational methods in various scientific and engineering fields. On the other hand, large-scale simulations require long execution times due to their extensive computational efforts. Parallel computing techniques, particularly the use of Graphics Processing Units (GPUs), have been introduced to address this issue. GPUs enable significant speedup compared to traditional CPU-based computations with their massively parallel architecture. However, it is not easy to modify a simulation code for CPUs to the one for GPUs. One of the sensitive issues is choosing an appropriate API for GPU computing.
CUDA is a widely used API for GPU computing. It provides low-level control over hardware resources to maximize performance. However, implementing CUDA requires extensive code modifications. In contrast, directive-based approaches such as OpenMP and OpenACC allow GPU utilization with minimal code changes. These methods can reduce the programming effort. Despite their advantages, comparative analysis of their performance, portability, and ease of implementation remains limited. Additionally, information on specific optimizations needed for directive-based approaches is still insufficient.
This study evaluates CUDA, OpenMP, and OpenACC by applying them to two simulation codes: magnetohydrodynamics (MHD) and atmospheric dynamics simulations. First, we compare the above programming methods in the MHD simulation. We analyze the execution time and the complexity of code modification. Next, we evaluate the simulation code for atmospheric dynamics which is modified for GPU executions. We also conduct performance comparison between GPU and CPU oriented codes Furthermore, because MHD and atmospheric simulations differ in computational models and data structures, we analyze how these differences affect GPU optimization.

講演情報

[M-GI31] 情報地球惑星科学とデータ利活用

[MGI31-P03] GPU並列化の地球物理系シミュレーションへの実装と評価