首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于卷积神经网络的田间多簇猕猴桃图像识别方法
引用本文:傅隆生,冯亚利,Elkamil Tol,刘智豪,李 瑞,崔永杰.基于卷积神经网络的田间多簇猕猴桃图像识别方法[J].农业工程学报,2018,34(2):205-211.
作者姓名:傅隆生  冯亚利  Elkamil Tol  刘智豪  李 瑞  崔永杰
作者单位:1. 西北农林科技大学机械与电子工程学院,杨凌 712100; 2. 农业部农业物联网重点实验室,杨凌 712100;,1. 西北农林科技大学机械与电子工程学院,杨凌 712100;,3. Precision Agriculture Research Chair, King Saud University, Riyadh 11451, Saudi Arabia;,1. 西北农林科技大学机械与电子工程学院,杨凌 712100;,1. 西北农林科技大学机械与电子工程学院,杨凌 712100;,1. 西北农林科技大学机械与电子工程学院,杨凌 712100; 2. 农业部农业物联网重点实验室,杨凌 712100;
基金项目:陕西省重点研发计划一般项目(2017NY-164);陕西省科技统筹创新工程计划项目(2015KTCQ02-12);国家自然科学基金资助项目(61175099);西北农林科技大学国际合作种子基金(A213021505)
摘    要:为实现田间条件下快速、准确地识别多簇猕猴桃果实,该文根据猕猴桃的棚架式栽培模式,采用竖直向上获取果实图像的拍摄方式,提出一种基于Le Net卷积神经网络的深度学习模型进行多簇猕猴桃果实图像的识别方法。该文构建的卷积神经网络通过批量归一化方法,以Re LU为激活函数,Max-pooling为下采样方法,并采用Softmax回归分类器,对卷积神经网络结构进行优化。通过对100幅田间多簇猕猴桃图像的识别,试验结果表明:该识别方法对遮挡果实、重叠果实、相邻果实和独立果实的识别率分别为78.97%、83.11%、91.01%和94.78%。通过与5种现有算法进行对比试验,该文算法相对相同环境下的识别方法提高了5.73个百分点,且识别速度达到了0.27 s/个,识别速度较其他算法速度最快。证明了该文算法对田间猕猴桃图像具有较高的识别率和实时性,表明卷积神经网络在田间果实识别方面具有良好的应用前景。

关 键 词:图像处理  图像识别  算法  深度学习  卷积神经网络  猕猴桃
收稿时间:2017/8/28 0:00:00
修稿时间:2017/12/26 0:00:00

Image recognition method of multi-cluster kiwifruit in field based on convolutional neural networks
Fu Longsheng,Feng Yali,Elkamil Tol,Liu Zhihao,Li Rui and Cui Yongjie.Image recognition method of multi-cluster kiwifruit in field based on convolutional neural networks[J].Transactions of the Chinese Society of Agricultural Engineering,2018,34(2):205-211.
Authors:Fu Longsheng  Feng Yali  Elkamil Tol  Liu Zhihao  Li Rui and Cui Yongjie
Institution:1. College of Mechanical and Electronic Engineering, Northwest A&F University, Yangling 712100, China; 2. Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture, Yangling 712100, China;,1. College of Mechanical and Electronic Engineering, Northwest A&F University, Yangling 712100, China;,3. Precision Agriculture Research Chair, King Saud University, Riyadh 11451, Saudi Arabia;,1. College of Mechanical and Electronic Engineering, Northwest A&F University, Yangling 712100, China;,1. College of Mechanical and Electronic Engineering, Northwest A&F University, Yangling 712100, China; and 1. College of Mechanical and Electronic Engineering, Northwest A&F University, Yangling 712100, China; 2. Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture, Yangling 712100, China;
Abstract:Abstract: China is the largest country for cultivating kiwifruit, and Shaanxi Province provides the largest production, which accounts for approximately 70% of the production in China and 33% of the global production. Harvesting kiwifruit in this region relies mainly on manual picking which is labor-intensive. Therefore, the introduction of robotic harvesting is highly desirable and suitable. The fast and effective recognition of kiwifruit in the field under natural scenes is one of the key technologies for robotic harvesting. Recently, the study on kiwifruit recognition has been limited to a single cluster and multi clusters in the field have seldom been considered. In this paper, according to growth characteristics of kiwifruit grown on sturdy support structures, an RGB (red, green, blue) camera was placed around 100 cm underneath the canopy so that kiwifruit clusters could be included in the images. We proposed a kiwifruit image recognition system based on the convolutional neural network (CNN), which has a good robustness avoiding the subjectivity and limitation of the features selection by artificial means. The CNN could be trained end to end, from raw pixels to ultimate categories, and we optimized the critical structure parameters and the training strategy. Ultimately, the network was made up of 1 input layer, 3 convolutional layers, 2 sub-sampling layers, 1 full convolutional layer, and 1 output layer. The CNN architecture was optimized by using batch normalization (BN) method, which normalized the data distribution of the middle layer and the output data, accelerating the training convergence and reducing the training time. Therefore, the BN layers were added after the 1, 3 and 5th convolutional layer (Conv1, Conv3, and Conv5 layer) of the original LeNet network. The size of all convolutional kernels was 5×5, and that of all the sub-sampling layers was 2×2. The feature map numbers of Conv1, Conv3, and Conv5 were 6, 16 and 120, respectively. After manual selection and normalizing, the RGB image of kiwifruit was transferred into a matrix with the size of 32×32 as the input of the network, stochastic gradient descent was used to train our models with mini-batch size of 100 examples, and momentum was set as 0.9. In addition, the CNN took advantages of the part connections, the weight sharing and Max pooling techniques to lower complexity and improve the training performance of the model simultaneously. The network used rectified linear units (ReLU) as activation function, which could greatly accelerate network convergence. The proposed model for training kiwifruit was represented as 32×32-6C-2S-16C-2S-120C-2. Finally, 100 images of kiwifruit in the field (including 5918 fruits) were used to test the model, and the results showed that the recognition ratios of occluded fruit, overlapped fruit, adjacent fruit and separated fruit were 78.97%, 83.11%, 91.01% and 94.78%, respectively. The overall recognition rate of the model reached 89.29%, and it only took 0.27 s in average to recognize a fruit. There was no overlap between the testing samples and the training samples, which indicated that the network had a high generalization performance, and the testing images were captured from 9 a.m. to 5 p.m., which indicated the network had a good robustness to lightness variations. However, some fruits were wrongly detected and undetected, which included the fruits occluded by branches or leaves, overlapped to each other and the ones under extremely strong sunlight. Particularly, 2 or more fruits overlapped were recognized as one fruit, which was the main reason to the success rate not very high. This phenomenon demands a further research. By comparing with the conventional methods, it suggested that the method proposed obtained a higher recognition rate and better speed, and especially it could simultaneously identify multi-cluster kiwifruit in the field, which provided significant support for multi-arm operation of harvesting robotic. It proves that the CNN has a great potential for recognition of fruits in the field.
Keywords:image processing  image recognition  algorithms  deep learning  convolutional neural network  kiwifruit
本文献已被 CNKI 等数据库收录!
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号