日本地球惑星科学連合2021年大会

講演情報

[E] ポスター発表

セッション記号 A (大気水圏科学) » A-CG 大気海洋・環境科学複合領域・一般

[A-CG36] 衛星による地球環境観測

2021年6月3日(木) 17:15 〜 18:30 Ch.06

コンビーナ:沖 理子(宇宙航空研究開発機構)、本多 嘉明(千葉大学環境リモートセンシング研究センター)、高薮 縁(東京大学 大気海洋研究所)、松永 恒雄(国立環境研究所地球環境研究センター/衛星観測センター)

17:15 〜 18:30

[ACG36-P15] Comparison between random forest and multiple linear regression algorithms for digital soil mapping in the Thung Kula Ronghai region, Thailand

*Sasirin Srisomkiew1,2、Masayuki Kawahigashi1、Pitayakon Limtong2 (1.Department of Geography, Tokyo Metropolitan University, Tokyo 192-0397, Japan 、2.Land Development Department, Ministry of Agriculture and Cooperatives, Bangkok 10900, Thailand)

キーワード:Predictor variables, Remote sensing, Spatial distribution, Spectral and terrain indices, Soil property

Digital soil mapping (DSM) increases use of machine learning (ML) algorithms to identify appropriate relationships between soil properties and environmental variables, enabling to predict the soil nutrient levels. Over the past decades, many studies have been employed the multiple linear regression (MLR) algorithm to estimate the spatial distribution of soil chemical properties in various landscapes. The TKR region is an essential agricultural field to produce a good quality jasmine rice and the rice has been successfully registered as a Protected Geographical Indication (PGI) by the European Union. However, the rice yield in this region is lower than those in other regions in the country. In this study, we compare the random forest (RF), which is the most popular ML algorithm for digital soil mapping, with multiple linear regression algorithm to map the spatial distribution of soil chemical properties in the Thung Kula Ronghai (TKR) region, Thailand. These algorithms were compared on the basis of three factors: (1) accuracy of the models, (2) predictor variables selection, and (3) the spatial distribution characteristics of soil properties. The dataset consisted of 186 soil samples collected from surface layer 0-30 cm and analyzed for nutrients. Landsat-8 images collected at bare land conditions with 30 m resolution were used to calculate the spectral indices. A digital elevation model with 5 m resolution was used to derive the terrain variable of the study area. Soil properties were estimated using predictor variables by multiple linear regression as a simple model and random forests as a complex model. Ten-fold cross-validation was used to determine model accuracy. Developed models using RF and MLR were evaluated in terms of the coefficient of determination, root mean square error and normalized root mean square error. The results demonstrated that the RF and MLR models successfully produced digital soil maps of various soil properties. The spectral indices of brightness, saturation, coloration, normalized difference water and moisture stress were the important predictor variables and were significantly correlated with various soil properties. The random forest predictions showed higher accuracy than those of MLR for most of the soil properties. The RF model produced more realistic results in terms of the correlation between predicted and measured soil data, indicating that random forests were more appropriate to make digital maps of soil chemical properties in the TKR region.