日本地球惑星科学連合2024年大会

講演情報

[E] 口頭発表

セッション記号 A (大気水圏科学) » A-AS 大気科学・気象学・大気環境

[A-AS02] 高性能計算が拓く気象・気候・環境科学

2024年5月29日(水) 15:30 〜 16:45 103 (幕張メッセ国際会議場)

コンビーナ:八代 尚(国立研究開発法人国立環境研究所)、中野 満寿男(海洋研究開発機構)、川畑 拓矢(気象研究所)、宮川 知己(東京大学大気海洋研究所)、座長:宮川 知己(東京大学大気海洋研究所)、八代 尚(国立研究開発法人国立環境研究所)


15:30 〜 15:45

[AAS02-07] Data access for km-scale resolution models

★Invited Papers

*Florian Andreas Ziemen1、Tobias Kölling2、Lukas Kluft2 (1.Deutsches Klimarechenzentrum GmbH, Hamburg, Germany、2.Max Planck Institute for Meteorology, Hamburg, Germany)

キーワード:Climate Modeling, Data, Workflows, High-resolution, km-scale

With the transition to global, km-scale simulations, model outputs have grown in size, and efficient ways of accessing data have become more important than ever. This implies that the data storage has to be optimized for efficient read access to small sub-sets of the data, and multiple resolutions of the same dataset need to be provided for efficient analysis on coarse as well as fine-grained scales.
We present an approach based on datasets. Each dataset represents a coherent subset of a model output (e.g. all model variables stored at daily resolution). Aiming for a minimum number of datasets drives us to enforce consistency in the model output and thus ease analysis. Each dataset is served to the user as one zarr store, independent of the actual file layout on disks or other storage media. Multiple datasets are grouped in catalogs for findability.
By serving the data via https, we implement a middle layer between the user and the storage systems, allowing to combine different storage backends behind a unifying frontend. At the same time, this approach allows us to largely build the system on existing technologies such as web servers and caches, and efficiently serve data to users outside DKRZ.
The approach we present is currently under development in the WarmWorld, nextGEMS and EERIE projects, and we expect it to be useful for many other projects as well.