首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进HRNet的遥感影像冬小麦语义分割方法
引用本文:李旭青,吴冬雪,王玉博,陈文博,顾会涛. 基于改进HRNet的遥感影像冬小麦语义分割方法[J]. 农业工程学报, 2024, 40(3): 193-200
作者姓名:李旭青  吴冬雪  王玉博  陈文博  顾会涛
作者单位:北华航天工业学院遥感信息工程学院,廊坊 065000;河北省航天遥感信息处理与应用协同创新中心,廊坊 065000;北京师范大学遥感科学国家重点实验室,北京 100875;中国科学院空天信息创新研究院遥感科学国家重点实验室,北京 100101
基金项目:河北省“三三三人才工程” 资助项目(C20221032);河北省高等学校科学技术研究青年拔尖人才项目(BJ2020056);遥感科学国家重点实验室开放基金资助(OFSLRSS202303);雄安新区生态环境高分遥感监测平台应用与示范(67-Y50G04-9001-22/23);北华航天工业学院研究生创新资助项目(YKY-2022-56);北华航天工业学院研究生创新资助项目(YKY-2023-66)
摘    要:冬小麦在影像中呈现田块碎小且分布零散等空间特征,同时影像包含的复杂地物对冬小麦识别造成干扰,易出现识别精度低且边界分割模糊等问题。为及时准确获取大范围冬小麦空间分布信息,该研究以高分二号卫星影像作为数据源,提出一种CAHRNet(change attention high-resolution Net)语义分割模型。采用HRNet(high-resolution Net)替换ResNet作为模型的主干网络,网络的并行交互方式易获取高分辨率的特征信息;联合OCR(object-contextual representations)模块聚合上下文信息,以增强像素点与目标对象区域的关联性;3)引入坐标注意力(coordinate attention)机制,使网络模型充分利用有效的空间位置信息,以保留分割区域的边缘细节,提高对分布零散、形状多变的冬小麦田块的特征提取能力。试验结果表明,在自制的高分辨率遥感数据集上,CAHRNet模型的平均交并比(mean intersection over union,MIoU)和像素准确率(pixel accuracy, PA)分别达到81.72%和97.0...

关 键 词:深度学习  语义分割  遥感影像  冬小麦  智能解译
收稿时间:2023-08-16
修稿时间:2023-12-15

Improved HRNet semantic segmentation model for remote sensing images of winter wheat
LI Xuqing,WU Dongxue,WANG Yubo,CHEN Wenbo,GU Huitao. Improved HRNet semantic segmentation model for remote sensing images of winter wheat[J]. Transactions of the Chinese Society of Agricultural Engineering, 2024, 40(3): 193-200
Authors:LI Xuqing  WU Dongxue  WANG Yubo  CHEN Wenbo  GU Huitao
Affiliation:School of Remote Sensing Information Engineering, North China Institute of Aerospace Engineering, Langfang 065000, China;Aerospace Remote Sensing Information Processing and Application Collaborative Innovation Center of Hebei Province, Langfang 06500, China;State Key Laboratory of Remote Sensing Science, Beijing Normal University. Beijing 100875, China;State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences. Beijing 100101, China
Abstract:Timely and accurate acquisition of winter wheat distribution is beneficial to the estimation of grain yield and optimization of planting structure. However, the previous means of acquisition cannot fully meet the needs of large-scale applications in recent years. Deep learning can be expected to introduce into agricultural remote sensing, in order to obtain the spatial distribution of agricultural resources, and further realize the development of agricultural modernization. Taking the winter wheat as the target object, the high-resolution Gaofen-2 remote sensing images were utilized to extract the winter wheat over a large area. Winter wheat was scattered distribution with the different field shapes in the remote sensing images. At the same time, the complex features were contained in the remote sensing images, leading to the interference in the recognition of winter wheat. In this study, a CAHRNet semantic segmentation network model was proposed to enhance the recognition accuracy and fuzzy details of field edges in the spatial distribution of winter wheat. The improved model first replaced the backbone feature network from ResNet with a high-resolution network HRNet. A unique parallel structure was connected to the multiple layers of sub-networks. The multiple sub-networks were continuously interacted with each other to repeatedly fuse the feature generated by multi-scale sub-networks. The spatial feature was enriched to rapidly locate the target object. The network was then maintained to extract the high-resolution feature. Secondly, the CAHRNet model was used to extract the spatial distribution of the winter wheat. The OCR module was introduced to aggregate the contextual feature of the target object. The extracted feature was classified by the backbone network. The aggregation of all the object regions was then evaluated to divide the regions into different categories. The correlation between the regions and the pixel points was enhanced to clarify the approximate location of the target object. Lastly, coordinate attention was introduced to capture the remote dependencies and retain the precise location of the target object, according to the dual spatial orientations. The correlation between multiple pixel points was enhanced to more accurately locate the exact position of the target object. The coordinate attention enhanced the spatial features of the target object and then reduced the detail loss caused by the model during feature extraction. Much more details were obtained than before in the prediction. The CAHRNet network model enhanced the supervision of the spatial features of the target object. The spatial relationship between the winter wheat and background pixels was established to enhance the region-pixel correlation. At the same time, the contextual feature of individual pixel points was utilized to enhance the pixel-point to pixel-point attention. The edge details of the target object region were preserved to highlight the overall features of winter wheat. The segmentation accuracy was improved to realize the fine extraction of winter wheat spatial distribution. The experimental results show that the optimal evaluation was achieved in each index on the high-resolution remote sensing dataset, with the MIoU and PA reaching 81.72% and 97.08%, respectively. The MIoU performance was improved by 9.90 and 2.44 percentage points, respectively, compared with the typical U-Net and DeepLabv3+, while the PA model was improved by 6.80 and 1.59 percentage points, respectively. The finding can provide technical support for the further acquisition of winter wheat crop distribution.
Keywords:deep learning  semantic segmentation  remote sensing image  winter wheat  intelligent interpretation
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号