首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于面向通道分组卷积网络的番茄主要器官实时识别
引用本文:周云成,许童羽,邓寒冰,苗腾.基于面向通道分组卷积网络的番茄主要器官实时识别[J].农业工程学报,2018,34(10):153-162.
作者姓名:周云成  许童羽  邓寒冰  苗腾
作者单位:沈阳农业大学信息与电气工程学院
基金项目:辽宁省科学事业公益研究基金(2016004001);国家自然科学基金(31601218);沈阳市重点科技研发计划项目(17-174-3-00)
摘    要:番茄器官的实时准确识别是实现自动采摘、靶向施药等自动化生产的关键。该文提出一种基于面向通道分组卷积网络的番茄主要器官实时识别网络模型,该模型直接用特征图预测番茄器官目标边界和类型。以统计可分性、计算速度等为判据,并结合样本扩增训练,分析了该网络和几种典型网络在番茄器官图像处理上的性能,以此筛选出识别网络的基础结构,在基础结构后面分别附加带dropout层的面向通道分组卷积模块和全卷积层作为识别网络的总体架构。试验结果表明:用面向通道分组卷积网络作为识别网络的基础结构,可在显著提高网络召回率、识别速度和精度的前提下,大幅降低模型的大小,该结构网络对花、果、茎识别的平均精度分别为96.52%、97.85%和82.62%,召回率分别为77.39%、69.33%和64.23%,识别速度为62帧/s;与YOLOv2相比,该文识别网络召回率提高了14.03个百分点,精度提高了2.51个百分点。

关 键 词:图像识别  算法  实时识别  番茄  卷积神经网络  面向通道分组卷积  特征提取
收稿时间:2018/1/28 0:00:00
修稿时间:2018/3/23 0:00:00

Real-time recognition of main organs in tomato based on channel wise group convolutional network
Zhou Yuncheng,Xu Tongyu,Deng Hanbing and Miao Teng.Real-time recognition of main organs in tomato based on channel wise group convolutional network[J].Transactions of the Chinese Society of Agricultural Engineering,2018,34(10):153-162.
Authors:Zhou Yuncheng  Xu Tongyu  Deng Hanbing and Miao Teng
Institution:College of Information and Electrical Engineering, Shenyang Agricultural University, Shenyang 110866, China,College of Information and Electrical Engineering, Shenyang Agricultural University, Shenyang 110866, China,College of Information and Electrical Engineering, Shenyang Agricultural University, Shenyang 110866, China and College of Information and Electrical Engineering, Shenyang Agricultural University, Shenyang 110866, China
Abstract:Abstract: The real-time and accurate recognition for the organs in tomato is crucial to achieve automated agricultural production, such as automatic harvesting and targeted drug application. Due to the difference of fruit maturity and flower age, and the color of tomato organs varying frequently during the growth period, it is very difficult for the simultaneous detection of different tomato organs with the traditional image segmentation method based on color space. In the meanwhile, because of the non-real-time nature of filters such as SIFT (scale-invariant feature transform) and Haar-like, they cannot be directly applied to the real-time detection of plant organs. The convolution neural networks (CNNs) can automatically extract the low-level image feature and high-level semantics of image, and they have real-time performance on GPU (graphics processing unit) devices. Therefore, inspired by Faster R-CNN, especially YOLOv2, in this paper, we proposed a real-time recognition method of main organs in tomato based on CNNs, and designed a corresponding recognition network model which can predict the object boundary and type only using feature map. In greenhouse, the images of various forms of tomato flower, fruit and stem organs were collected, and the image data set of tomato organs was constructed according to the influence of illumination of the image which was considered during the acquisition process. For the sake of screening the underlying network structure for recognition network, the performance and applicability of several typical CNN-based classification networks were analyzed based on the criterion of model size, statistical separability, classification performance, and computation speed. Inspired by the advantages of these typical networks, a channel wise group convolutional (CWGC) block and a corresponding classification network (CWGCNet) were designed. A sample extension training method was presented to further improve the feature extraction ability of these classification networks. The CWGCNet, Darknet-19 and Inception v2 were selected as candidate infrastructure for recognition network. Subsequently, a CWGC block with dropout layer and 3 full convolution layers were respectively attached to the infrastructure to form the overall recognition architecture. Based on the Microsoft Cognitive Toolkit (CNTK), all CNN-based classification networks and recognition networks were implemented by using of Python, and the relevant experiment was performed on a computer equipped with a Tesla K40c GPU. The results show that, compared with the typical CNN-based classification networks, CWGCNet combines high feature statistical separability and real-time performance. On tomato organ image dataset, using Caltech256 to perform sample extension training can significantly improve the feature extraction ability of the classification networks. Compared with the exponential function, the nonlinear scaling factor in the Sigmoid form makes the recognition networks easier to train. In contrast to the 3 full convolution layers, using CWGC block with dropout as an additional convolution layer to the recognition network CNN infrastructure can dramatically reduce the size of the model, while significantly improve the network recall rate, recognition speed and average precision (AP). The convolution part of CWGCNet and the CWGC block with dropout are used as the final structure of the recognition network. The final recognition network can identify the different maturity and different forms of tomato organs, which gets the AP of 96.52%, 97.85% and 82.62% respectively for flower, fruit and stem. The growth stage and maturity of tomato organs have a certain influence on the recognition accuracy, and especially the flowering flower, full maturity fruit and lower stem have higher recognition accuracy. The final network can recall different forms of tomato organs, and the recall rates of flower, fruit and stem can reach 77.39%, 69.33% and 64.23% separately. And the recognition speed of the final network is 62 fps. Compared with YOLOv2, the recall rate can be improved by 14.03 percentage points, and AP can be improved by 2.51 percentage points.
Keywords:image recognition  algorithms  real time system  tomato  convolution neural network  channel wise group convolution  feature extraction
本文献已被 CNKI 等数据库收录!
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号