Japan Geoscience Union Meeting 2023

Presentation information

[J] Oral

S (Solid Earth Sciences ) » S-TT Technology & Techniques

[S-TT43] Creating future of solid Earth science with high performance computing (HPC)

Mon. May 22, 2023 9:00 AM - 10:15 AM International Conference Room (IC) (International Conference Hall, Makuhari Messe)

convener:Takane Hori(Japan Agency for Marine-Earth Science and Technology), Yuji Yagi(Graduate School of Life and Environmental Sciences, University of Tsukuba, Tsukuba), Katsuhiko Shiomi(National Research Institute for Earth Science and Disaster Resilience), Takanori Matsuzawa(National Research Institute for Earth Science and Disaster Resilience), Chairperson:Takane Hori(Japan Agency for Marine-Earth Science and Technology), Katsuhiko Shiomi(National Research Institute for Earth Science and Disaster Resilience)

9:00 AM - 9:15 AM

[STT43-01] Optimizations of H-matirx-vector Multiplication for Many-core Processors

★Invited Papers

*Tetsuya Hoshino1, Akihiro Ida3, Toshihiro Hanawa2 (1.Nagoya University, 2.The University of Tokyo, 3.Japan Agency for Marine-Earth Science and Technology)

Keywords:H-matrix, Many-core

Hierarchical matrices (H-matrices) can robustly approximate the dense matrices that appear in the boundary element method (BEM). To accelerate the solving of linear systems in the BEM, we must speed up the matrix–vector multiplication in the iterative linear solver. However, speed-up approaches are usually developed for dense or sparse matrices, and are rarely reported for hierarchical matrix–vector multiplication ( HiMV). The HiMV algorithm generates a large number of matrix–vector multiplications, which have not been sufficiently discussed. Therefore, the efficiency of HiMV has not reached its potential. This presentation discusses optimization methodologies of HiMV for modern multi/many-core CPUs or GPUs: an H-matrix storage method for efficient memory access, a method that avoids write contentions during reduction operations on the solution vector, an inter-thread load-balancing method, and blocking and sub-matrix sorting methods for cache efficiency.