首页 | 本学科首页   官方微博 | 高级检索  
     检索      

CNN-ISS遥感影像分类的瓦片边缘效应及消除方案
引用本文:段增强,刘杰东,鹿鸣,孔祥斌,杨娜.CNN-ISS遥感影像分类的瓦片边缘效应及消除方案[J].农业工程学报,2021,37(2):209-217.
作者姓名:段增强  刘杰东  鹿鸣  孔祥斌  杨娜
作者单位:中国农业大学土地科学与技术学院 自然资源部农用地质量与监测重点实验室,北京 100193
基金项目:自然资源部国土卫星遥感应用中心,自然资源督察遥感监测指标与分析方法研究(2020072;11910661);国家社会科学基金重大项目"休养生息制度背景下的耕地保护转型研究(19ZDA096)
摘    要:应用卷积神经网络语义分割模型(Image Semantic Segmentation based on Convolutional Neural Network,CNN-ISS)进行遥感影像分类时,需将大幅影像分解为特定大小瓦片影像,并将其作为CNN-ISS处理对象,这一过程破坏了位于瓦片边缘处地物的完整几何及纹理特征,从而影响瓦片边缘处地物的识别效果,即瓦片边缘效应。该研究以DeepLab V3为CNN-ISS核心模型,对唐山农村地物进行语义分割,定量分析了分类结果的瓦片边缘效应,并提出了5个消除此效应的后处理方案。结果表明:像素分类精度与像素到瓦片边缘距离正相关,瓦片边缘处错误率最高达6.93%,中央处错误率最低为3.52%,存在瓦片边缘效应;采用该研究提出的瓦片边缘效应消除方案后,整幅影像的总精度(Pixel Accuracy,PA)、均交并比(Mean Intersection over Union,mIoU)和Kappa系数均有提升,最高分别提升0.40、1.97个百分点和0.0122。在不改变CNN-ISS核心模型条件下,通过该研究的瓦片边缘效应消除后处理方案,可有效提升遥感影像分类精度,尤其针对复杂异构体和线状地物精度提升效果更好。

关 键 词:遥感  卷积神经网络  语义分割  影像分类  瓦片边缘效应
收稿时间:2020/9/27 0:00:00
修稿时间:2020/1/13 0:00:00

Tile edge effect and elimination scheme of image classification using CNN-ISS remote sensing
Duan Zengqiang,Liu Jiedong,Lu Ming,Kong Xiangbin,Yang Na.Tile edge effect and elimination scheme of image classification using CNN-ISS remote sensing[J].Transactions of the Chinese Society of Agricultural Engineering,2021,37(2):209-217.
Authors:Duan Zengqiang  Liu Jiedong  Lu Ming  Kong Xiangbin  Yang Na
Institution:College of Land Science and Technology, China Agricultural University, Key Laboratory of Agricultural Land Quality and Monitoring, Ministry of Natural Resources, Beijing 100193, China
Abstract:Abstract: Semantic segmentation of an image has become a key interdisciplinary application in the image processing, computer vision, pattern recognition, and artificial intelligence. In deep learning architectures, the Convolutional Neural Network for Interferometric Semantic Segmentation (CNN-ISS) is widely used in digital image processing and machine vision. The CNN-ISS can be utilized to effectively extract further features, such as texture and geometric features, indicating stronger transfer learning and generalization, compared with traditional image classifications of remote sensing. As such, the CNN-ISS is suitable for the interpretation of high-resolution remote sensing image, identification of complicated features, and crop mapping. In classification, large remote sensing images need to be segmented into specific tiled images, thereby to serve as the object of Convolutional Neural Network (CNN) processing. However, an artificial image tiling can generate fragments on the edge of a tile, leading to the low classification accuracy of pixels near the edge of the tile. Here, the phenomenon was defined as the edge effect of tiled images, where the classification accuracy of pixels near the edge of the tile was lower than that of the central area. In this work, two indicators was designed, including the error rate with a distance to tile edges (ERD), and the error rate of the whole image (ERW), to quantify the edge effect of CNN-ISS processed tiled images. Meanwhile, the offset positions (i, k) were set for the starting point of the shift window to ensure that any pixel on the whole image must be in the central area of the tile generated under a certain offset setting. Then, five technical solutions were obtained to test the minimized edge effect of tiled images using the scores in multiple groups of categories. Taking the Tangshan as the segmented typical rural surface, a DeepLab V3 was selected as the core model of CNN-ISS to analyze the edge effect of the classification. The results showed that the pixel classification accuracy was positively correlated with the distance from the pixel to the edge of a tiled image. The highest error rate was 6.93% occurred along the edge of the tiled image, and the lowest error rate was 3.52% in the center of the tile, indicating the accuracy of the central area was higher than that of the edge. It showed an obvious edge effect of tiled images. In edge effect elimination scheme for the tiled images, the total classification accuracy improved significantly, where the Kappa coefficient and Mean Intersection over Union (mIoU) of the entire image increased 0.012 2 and 1.97 percent point, respectively. Taking the Kappa coefficient, one of the classic accuracy indices for the remote sensing image interpretation, as an example, the order of accuracy including the control group was: solution 2 (0.881 0)> solution 5 (0.878 9) > solution 3 (0.878 8) > solution 4 (0.877 7) > solution 1 (0.875 9) > the control group (0.868 8). Besides, the solutions of edge effects depended mainly on the types of features in the tiled images. The general law was that the tile edge effects of linear features and complex isomers (pit ponds, rural residential areas) were more obviously improved the accuracy, as the solutions were more significantly accurate, compared with that of the base land, or other agricultural land. Compared with the control group, the improvement order of IoU in the solution 2 was: roads (4.13 percent point) > pit ponds (2.97 percent point) > rivers and ditches (1.61 percent point) > rural residential areas (0.65 percent point) > other agricultural land (0.46 percent point). Without changing the core model of CNN semantic segmentation, the elimination scheme for the edge effect of a tile can be used to effectively improve the accuracy of remote sensing image classification, especially for the linear features and complex isomers.
Keywords:remote sensing  convolutional neural network  semantic segmentation  edge effect of tiled images
本文献已被 CNKI 维普 等数据库收录!
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号