JpGU-AGU Joint Meeting 2020

講演情報

[E] 口頭発表

セッション記号 H (地球人間圏科学) » H-TT 計測技術・研究手法

[H-TT14] 地理情報システムと地図・空間表現(英語)

コンビーナ:小口 高(東京大学空間情報科学研究センター)、若林 芳樹(首都大学東京大学院都市環境科学研究科)、Yuei-An Liou(National Central University)、Ronald C. Estoque(National Institute for Environmental Studies, Japan)

[HTT14-13] 空間統計解析において標本調査データの利用が及ぼしうる影響についての一考察

★招待講演

*山田 育穂1 (1.東京大学 空間情報科学研究センター)

キーワード:標本調査データ、空間統計解析、標本抽出率、ローカルモラン統計量

Data used in spatial analysis are often based on samples rather than the population. Especially when research interests lie in human related issues such as their attributes, behaviors, and opinions, conducting a questionnaire survey for individuals sampled from areal units, for example, municipalities and ZIP code areas that compose a study region is a frequently used method to generate spatial data about the issues. Because characteristics of the population estimated based on the sample survey data inevitably have some discrepancies from true characteristics of the population, a variety of research and discussion have been done in the field of traditional, non-spatial statistics. However, in the context of spatial statistical analysis, there has been little discussion on potential influence that such sample survey data might have on analysis results, whereas the discrepancies in sample survey data could be more problematic in spatial statistical analysis, which integrates multiple data values collected for individual areal units. This research therefore aims to investigate how the use of data collected through sample surveys potentially influences spatial statistical analysis as the first step to develop methodologies to avoid and/or diminish such influence.

To quantitatively examine potential influence of sample survey data, this research used statistical simulations in which clustering spatial patterns were intentionally generated in a study region and then analyzed by a spatial statistical method to see if/how analysis results with the entire population data and those with sample data selected from it would differ from each other. The study region was a simple, 10x10 grid system, and the statistical method used was local Moran’s I statistic designed to identify spatial autocorrelation around individual areal units in the study region. Systematically designed 8 types of clustering spatial patterns were examined with 5 levels of sampling rates. Results suggested that statistical power of local Moran’s I analysis to detect clustering patterns decreased as the sampling rates decreased, and the sampling rates of less than or equal to 0.01 seem to be critically problematic. It was also suggested that strength of such influence varied depending on the types of clustering patterns and relative locations of areal units in the study region. As these results confirmed that the use of sample survey data actually influence results of spatial statistical analysis, further research is needed to obtain more generic, detailed properties of such influence for contributing to development of methodologies to avoid and/or diminish it.