IGSNRR OpenIR
Evaluation of conditioned Latin hypercube sampling for soil mapping based on a machine learning method
Yang, Lin1,2; Li, Xinming2,3; Shi, Jingjing2; Shen, Feixue1; Qi, Feng4; Gao, Binbo5; Chen, Ziyue6; Zhu, A-Xing2,3,7; Zhou, Chenghu1,2
2020-06-15
Source PublicationGEODERMA
ISSN0016-7061
Volume369Pages:15
Corresponding AuthorChen, Ziyue(zychen@bnu.edu.cn)
AbstractSampling design plays an important role in soil survey and soil mapping. Conditioned Latin hypercube sampling (cLHS) has been proven as an efficient sampling strategy and used widely in digital soil mapping. cLHS samples are randomly selected in each stratum of environmental variables, thus the produced sample sets can vary significantly at different runs with the same sample size. Although variation of mapping accuracies caused by the randomness of cLHS has been realized and qualitatively mentioned in past studies. However, how the randomness of cLHS could quantitatively influence mapping accuracy has rarely been examined. In this study, we conducted experiments to examine how the sample randomness quantitatively influence soil mapping accuracy with different sample sizes, and analyzed the possible reasons from a pedogenesis perspective. The results showed that the largest range of mapping accuracies of 500 repeats was 39.5% at a sample density of 2.59 point/ km(2), while the smallest range was 7.3% at the maximum sample size with a sample density of 32.47 point/km(2). The sample density for satisfactory prediction accuracies in our study area was at least 10.06 Point/km(2). The results showed that both the allocation of sample points to each soil series and the typicality of sample points played important roles in mapping accuracies. But the deep reasons causing the unstable performance of cLHS at small sample sizes were the imbalanced class distribution of soil series and the overlap between soil series in the distribution of environmental covariates. Researchers need to be cautious about the output when applying cLHS with small sampling densities. Some effective approaches to address this issue include increasing the sample size, checking the sample allocations of a cLHS design with the assistance of legacy soil maps, or adding the legacy soil map as a variable during sampling design. When the sampling resources and legacy soil maps are limited for an area, fuzzy k-means clustering sampling could be a potential alternative. This study provides useful references for better understanding the uncertainty of cLHS when the sample density is small and selecting alternative sampling methods accordingly.
KeywordConditioned Latin hypercube sampling Soil mapping Representativeness Sample randomness
DOI10.1016/j.geoderma.2020.114337
WOS KeywordRANDOM FORESTS ; DESIGN ; MODEL ; CLASSIFICATION ; PREDICTION ; STOCKS ; OPTIMIZATION ; VARIABILITY ; VALIDATION ; REGRESSION
Indexed BySCI
Language英语
Funding ProjectNational Natural Science Foundation of China[41971054] ; National Natural Science Foundation of China[41530749] ; Leading Funds for the First class Universities[020914912203] ; Leading Funds for the First class Universities[020914902302]
Funding OrganizationNational Natural Science Foundation of China ; Leading Funds for the First class Universities
WOS Research AreaAgriculture
WOS SubjectSoil Science
WOS IDWOS:000524458800004
PublisherELSEVIER
Citation statistics
Cited Times:8[WOS]   [WOS Record]     [Related Records in WOS]
Document Type期刊论文
Identifierhttp://ir.igsnrr.ac.cn/handle/311030/133860
Collection中国科学院地理科学与资源研究所
Corresponding AuthorChen, Ziyue
Affiliation1.Nanjing Univ, Sch Geog & Ocean Sci, Nanjing 210023, Peoples R China
2.Chinese Acad Sci, State Key Lab Resources & Environm Informat Syst, Inst Geog Sci & Nat Resources Res, Beijing 100101, Peoples R China
3.Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
4.Kean Univ, Sch Environm & Sustainabil Sci, Union, NJ 07083 USA
5.China Agr Univ, Coll Land Sci & Technol, Tsinghua East Rd, Beijing 100083, Peoples R China
6.Beijing Normal Univ, Coll Global Change & Earth Syst Sci, 19 Xinjiekouwai St, Beijing 100083, Peoples R China
7.Nanjing Normal Univ, Key Lab Virtual Geog Environm, Minist Educ, Nanjing 210023, Peoples R China
Recommended Citation
GB/T 7714
Yang, Lin,Li, Xinming,Shi, Jingjing,et al. Evaluation of conditioned Latin hypercube sampling for soil mapping based on a machine learning method[J]. GEODERMA,2020,369:15.
APA Yang, Lin.,Li, Xinming.,Shi, Jingjing.,Shen, Feixue.,Qi, Feng.,...&Zhou, Chenghu.(2020).Evaluation of conditioned Latin hypercube sampling for soil mapping based on a machine learning method.GEODERMA,369,15.
MLA Yang, Lin,et al."Evaluation of conditioned Latin hypercube sampling for soil mapping based on a machine learning method".GEODERMA 369(2020):15.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Yang, Lin]'s Articles
[Li, Xinming]'s Articles
[Shi, Jingjing]'s Articles
Baidu academic
Similar articles in Baidu academic
[Yang, Lin]'s Articles
[Li, Xinming]'s Articles
[Shi, Jingjing]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yang, Lin]'s Articles
[Li, Xinming]'s Articles
[Shi, Jingjing]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.