Japan Geoscience Union Meeting 2025

Presentation information

[J] Oral

H (Human Geosciences ) » H-GM Geomorphology

[H-GM04] Geomorphology

Mon. May 26, 2025 9:00 AM - 10:30 AM 103 (International Conference Hall, Makuhari Messe)

convener:Junko Iwahashi(GSI of Japan), Hitoshi SAITO(Graduate School of Environmental Studies, Nagoya University), Shintaro Takanami(University of Tsukuba), Daniel R Newman(Hokkaido University), Chairperson:Shintaro Takanami(Meiji University)


9:45 AM - 10:00 AM

[HGM04-04] DEMFLUX Geology-to-Terrain Model: Automatic Terrain Generation by Generative AI Trained on Actual Topography

*Fumihiko Ikegami1 (1.Ikegami GeoResearch)

Keywords:Large-scale topography, Plate Tectonics, Generative AI

Automatic terrain generation has a long history, dating back to early procedural methods such as Perlin noise and fractal algorithms in the late 20 centuries. While software like Terragen was popular at the time, these approaches often produced repetitive or fractal patterns and neglected key geological processes, particularly erosion and the diversity of rock types, faults, and volcanic features. Consequently, these early techniques lacked realism. Although the introduction of river drainage simulations in the 2010s provided slight improvements, the outputs still fell short of capturing geological consistency. In the 2020s, however, neural network–based approaches started to enable the generation of more diverse and geologically informed terrains, in contrast to the limitations of earlier procedural methods.

In this study, I developed large-scale (2,000 × 2,000 km and 4,000 × 4,000 km) terrain models using a geologically trained text-to-image diffusion model. This work builds upon FLUX (Black Forest Lab, 2023), which was fine-tuned with the GEBCO 2023 Grid (GEBCO, 2023) and my own geological annotations. The training dataset consisted of 644 grayscale images of actual Earth topography, each paired with a corresponding text file describing the geological context. The dataset was augmented through systematic rotations of the original topographic images, increasing sample diversity.

Qualitative evaluation suggests that the resulting terrain captures essential aspects of plate tectonics, including the bimodal distribution of oceanic and continental crust, oceanic ridges, transform faults, subduction zones, and orogenic belts. The generated terrains realistically transition among convergent, divergent, and strike-slip boundaries. However, resolution constraints and limited training data occasionally hinder the model’s ability to generate finer-scale features such as volcanic fronts, accretionary prisms, and complex interactions at triple junctions. Additionally, flat plains sometimes exhibit noise, likely due to the dynamic range constraints of 8-bit grayscale images.

Overall, this diffusion-based model demonstrates promise for realistic and geologically coherent terrain generation. It also highlights the potential of linking geological terminology directly to resultant topographic forms. Future work will focus on integrating additional constraints, such as those provided by ControlNet (Illyasviel, 2022), to allow greater control over features like plate boundary geometry, which could be particularly useful for paleogeographic reconstructions.