日本地球惑星科学連合2014年大会

講演情報

口頭発表

セッション記号 P (宇宙惑星科学) » P-EM 太陽地球系科学・宇宙電磁気学・宇宙環境

[P-EM35_2PM2] プラズマ宇宙:シミュレーション技法,データ解析・可視化

2014年5月2日(金) 16:15 〜 17:00 503 (5F)

コンビーナ:*松清 修一(九州大学大学院総合理工学研究院流体環境理工学部門)、松元 亮治(千葉大学)、座長:三好 隆博(広島大学大学院理学研究科物理科学専攻)

16:30 〜 16:45

[PEM35-05] 磁気圏MHDシミュレーションコードのXeon Phiに対する最適化

*深沢 圭一郎1 (1.九州大学情報基盤研究開発センター)

For investigating the global structures of plasma, such as the planetary magnetospheres, the Magnetohydrodynamic (MHD) equations are often used, in which full kinetics of plasma are neglected by taking the moments of the Vlasov equations. The MHD equations are highly nonlinear and are very complex to solve by hand calculations. Thus computer simulations play essential roles in studies of global magnetosphere.The numerical MHD code for the magnetosphere has been optimized for vector-type supercomputers for a long time because the MHD code is a kind of fluid code and most of supercomputers with vector processors have high performance to solve the fluid codes in 1990's. These codes often have achieved a very high computational efficiency (the ratio of the effective performance to the theoretical performance). However, almost 100% of the "Top 500" supercomputer systems in the world adopt the massively parallel scalar type processors and more than 85% of systems consist of the x86 processor architecture in these days. The other scalar type computers are POWER and SPARC architectures. Recently the new coprocessor Xeon Phi which has many cores (~60 cores) of X86 architecture is introduced to the supercomputer system and achieved good performance of the Linpack benchmark. In this study I evaluate the performance of MHD code for the planetary magnetosphere on the single Xeon Phi coprocessor. For the performance evaluation, I use the three-dimensional domain decomposition method and a cache-hit type of three-dimensional domain decomposition method with the flat MPI and hybrid MPI. As the results, I found that normal three-dimensional decomposition of the MHD model with the hybrid MPI is suitable for Xeon Phi coprocessor and achieved computing performance efficiency of ~7%. Furthermore, I add the optimization to the MHD code based on the Xeon Phi architecture and obtained the computing performance efficiency of almost 10% which is the double performance of FX10 single node.