首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于多源环境变量和随机森林的橡胶园土壤全氮含量预测
引用本文:郭澎涛,李茂芬,罗微,林清火,唐群锋,刘志崴.基于多源环境变量和随机森林的橡胶园土壤全氮含量预测[J].农业工程学报,2015,31(5):194-202.
作者姓名:郭澎涛  李茂芬  罗微  林清火  唐群锋  刘志崴
作者单位:1. 中国热带农业科学院橡胶研究所,儋州 571737;,2. 中国热带农业科学院科技信息研究所,儋州 571737;,1. 中国热带农业科学院橡胶研究所,儋州 571737;,1. 中国热带农业科学院橡胶研究所,儋州 571737;,3. 海南农垦科学院,海口 570206;,3. 海南农垦科学院,海口 570206;
基金项目:国家科技支撑计划课题(2009BADA1B04);国家天然橡胶产业技术体系(CARS-34)
摘    要:土壤全氮与土壤肥力和土壤氮循环紧密相关。掌握土壤全氮详细的空间分布信息对提高土壤肥力管理效率和更好地了解土壤氮循环至关重要。该文以儋州国营橡胶园为研究区域,采集2511个土壤样品,利用随机森林(random forest,RF)、逐步线性回归(stepwise linear regression,SLR)、广义加性混合模型(generalized additive mixed model,GAMM)以及分类回归树(classification and regression tree,CART)结合多源环境变量(成土母质、平均降雨量、平均气温和归一化植被指数)对研究区橡胶园土壤全氮含量进行空间预测,并通过754个独立验证点比较了4种模型的预测精度。结果表明RF对土壤全氮的预测值和实测值的相关系数(0.82)明显高于SLR(0.68)、GAMM(0.70)和CART(0.69),而RF的预测平均绝对误差(0.08836 g/kg)和均方根误差(0.13090 g/kg)均低于SLR、GAMM和CART。此外,RF模型预测结果能反映更为详细的局部土壤全氮含量空间变化信息,与实际情况更为接近。可见,RF模型可作为橡胶园土壤全氮含量空间分布预测的高效方法,为其他土壤属性的空间分布预测提供了一种新的方法。

关 键 词:土壤    降雨    数字土壤制图  区域尺度  肥力  空间分布  橡胶树
收稿时间:2014/12/23 0:00:00
修稿时间:2015/2/10 0:00:00

Prediction of soil total nitrogen for rubber plantation at regional scale based on environmental variables and random forest approach
Guo Pengtao,Li Maofen,Luo Wei,Lin Qinghuo,Tang Qunfeng and Liu Zhiwei.Prediction of soil total nitrogen for rubber plantation at regional scale based on environmental variables and random forest approach[J].Transactions of the Chinese Society of Agricultural Engineering,2015,31(5):194-202.
Authors:Guo Pengtao  Li Maofen  Luo Wei  Lin Qinghuo  Tang Qunfeng and Liu Zhiwei
Institution:1. Rubber Research Institute, Chinese Academy of Tropical Agriculture Sciences, Danzhou 571737, China;,2. Institute of Scientific and Technical Information, Chinese Academy of Tropical Agriculture Sciences, Danzhou 571737, China;,1. Rubber Research Institute, Chinese Academy of Tropical Agriculture Sciences, Danzhou 571737, China;,1. Rubber Research Institute, Chinese Academy of Tropical Agriculture Sciences, Danzhou 571737, China;,3. Hainan Agricultural Reclamation Academy of Sciences, Haikou 570206, China; and 3. Hainan Agricultural Reclamation Academy of Sciences, Haikou 570206, China;
Abstract:Abstract: Soil total nitrogen (STN) plays an important role in soil fertility and N cycle. Detailed information about the spatial distribution of STN is vital to effective management of soil fertility and better understanding of the process of N cycle. To date, however, few studies have been conducted to digitally map the spatial variation of STN for rubber (Hevea brasiliensis) plantation at the regional scale in Hainan Island, China. In this study, a relatively new method, random forest (RF) was proposed to predict and map the spatial pattern of STN for the rubber plantation. A total of 2511 topsoil (0-20 cm) samples were collected, and their STN contents were measured. Then these soil samples were randomly divided into calibration dataset (1757 soil samples) and validation dataset (754 soil samples). Fourteen environmental variables were also collected. They are parent materials, mean precipitation, mean temperature, mean normalized difference vegetation index, elevation, slope, aspect, horizontal curvature, profile curvature, relief, convergence index, relative position index, stream power index, and topographic wetness index. In this study, stepwise linear regression (SLR), generalized additive mixed model (GAMM), classification and regression tree (CART), and random forest (RF) were used to predict and map the spatial distribution of STN for the rubber plantation. In addition, GAMM and CART were also employed to uncover relationships between STN and environmental variables and further to identify the main factors influencing STN variation. The RF model was developed to predict spatial variability of STN on the basis of parent materials, mean precipitation, mean temperature, and mean normalized difference vegetation index. Performance of RF was compared with SLR, GAMM, and CART. Mean error (ME), mean absolute error (MAE), root mean squared error (RMSE), and correlation coefficient between measured STN and predicted STN were selected as comparison criteria. Results showed that RF performed much better than SLR, GAMM, and CART in predicting and mapping the spatial distribution of STN for the rubber plantation at regional scale in this study. The RF model had much higher correlation coefficient value and lower prediction errors (ME, MAE, and RMSE) than SLR, GAMM, and CART. Values of correlation coefficient, ME, MAE, and RMSE were 0.82, -0.003 g/kg, 0.088 g/kg, and 0.131 g/kg, 0.69, 0.003 g/kg, 0.121 g/kg, and 0.162 g/kg, 0.70, -0.004 g/kg, 0.120 g/kg, and 0.160 g/kg, and 0.68, -0.008 g/kg, 0.121 g/kg, 0.163 g/kg for RF, CART, GAMM, and SLR equation, respectively. Moreover, RF model yielded a more realistic spatial distribution of STN than SLR, GAMM, and CART equations. Finally, results of CART and GAMM showed that the relationships between STN and selected environmental variables (parent materials, mean precipitation, mean temperature, and mean normalized difference vegetation index) were hierarchical and non-linear in this study area. Analysis of variable importance indicated that parent materials and mean precipitation were the most important factors influencing spatial distribution of STN for rubber plantation at regional scale in this study. Overall, the good performance of RF model could be ascribed to its good capabilities of dealing with non-linear and hierarchical relationships between STN and environmental variables. These results suggested that RF is a promising approach in predicting spatial distribution of STN for rubber plantation at regional scale, and can be applied to predict other soil properties in regions with complex soil-environmental relationships.
Keywords:soils  nitrogen  precipitation  digital soil mapping  regional scale  fertility  spatial distribution  rubber tree
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号