首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于优化Transformer网络的绿色目标果实高效检测模型
引用本文:贾伟宽,孟虎,马晓慧,赵艳娜,Ji Ze,郑元杰.基于优化Transformer网络的绿色目标果实高效检测模型[J].农业工程学报,2021,37(14):163-170.
作者姓名:贾伟宽  孟虎  马晓慧  赵艳娜  Ji Ze  郑元杰
作者单位:1.山东师范大学信息科学与工程学院,济南 250358;2.机械工业设施农业测控技术与装备重点实验室,镇江 212013;3.卡迪夫大学工程学院,卡迪夫 CF24 3AA,英国
基金项目:山东省自然科学基金(ZR2020MF076,ZR2019ZD04);国家自然科学基金资助项目(62072289,81871508);山东省重点研发计划项目(2019GNC106115);山东省泰山学者基金(TSHW201502038)
摘    要:果园环境中,检测目标果实易受复杂背景、果实姿态和颜色等因素影响,为提高绿色目标果实检测的精度与效率,满足果园智能测产和自动化采摘要求,本研究针对不同光照环境和果实姿态,提出一种适于样本数量不足的绿色目标果实高效检测模型。该模型采用优化Transformer结构,首先借助卷积神经网络(Convolutional Neural Network,CNN)网络提取图像特征;然后输入编码-解码器生成一组目标果实预测框,最后通过前馈神经网络(Feed-forward Network,FFN)结构预测检测结果。在训练过程中,引入重采样法扩充样本数量,解决样本数量不足问题;引入迁移学习,加速网络收敛。分别制作苹果、柿子数据集用于模型训练。试验结果表明,经迁移学习后该模型训练效率大幅提高;与流行的目标检测模型相比,优化后的模型在检测绿色柿子与绿色苹果时,精度分别为93.27%和91.35%。该方法可为其他果蔬绿色目标检测提供理论借鉴。

关 键 词:目标检测  绿色果实  重采样法  迁移学习  Transformer网络
收稿时间:2021/5/23 0:00:00
修稿时间:2021/7/7 0:00:00

Efficient detection model of green target fruit based on optimized Transformer network
Jia Weikuan,Meng Hu,Ma Xiaohui,Zhao Yann,Ji Ze,Zheng Yuanjie.Efficient detection model of green target fruit based on optimized Transformer network[J].Transactions of the Chinese Society of Agricultural Engineering,2021,37(14):163-170.
Authors:Jia Weikuan  Meng Hu  Ma Xiaohui  Zhao Yann  Ji Ze  Zheng Yuanjie
Institution:1.School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China; 2.Key Laboratory of Facility Agriculture Measurement and Control Technology and Equipment of Machinery Industry, Zhenjiang 212013, China;;3.School of Engineering, Cardiff University, Cardiff CF24 3AA, UK
Abstract:Abstract: The posture of target fruit is ever-changing in the complex orchard environment. Some target fruits are homochromatic with background, and the limited number of samples have brought great challenges to accurately detect the target, due mainly to the difficulty of collecting some environmental data. Therefore, the detection needs to meet the high requirements of intelligent yield measurement and automatic harvesting, in terms of both accuracy and efficiency. In this study, an efficient detection model was proposed for the green target fruits suitable for small samples under different light conditions and fruit postures. The optimized transformer network was also employed in this model. Firstly, the convolutional neural network (CNN) was used to extract image features. The transformer encoder was input after feature dimension reduction and positional encoding. Multi-head attention and feed-forward network (FFN) were then selected to obtain the encoder output. Secondly, the transformer decoder processed the input using multi-head attention and feed-forward network. The positional encoding was then added to each link. After that, the outputs were generated with different data sizes. The bounding boxes were much larger than the actual objects after prediction, indicating a low missing rate of green target fruit after decoder settlement. Finally, the feed-forward network (FFN) was utilized to predict the detection. The training of detection model was mostly used sufficient samples to avoid overfitting in the training process for higher generalization of the model. Bootstrapping was also introduced to repeatedly mapping the original data for several times. As such, the expanding dataset was utilized to meet the high requirement of larger samples for the higher accuracy of detection mode in the training process. Transfer learning was selected to significantly improve the training efficiency of the model, while, accelerate the convergence of the network. The apple and persimmon datasets were made separately for the model training. The experimental results show that the training efficiency of the model was greatly improved by more than 13% after migration learning. An excellent illustration of features transferability increased the speed and efficiency of detection, as the difference decreased between the pre-training and target task. Transfer learning was adopted to improve the efficiency of the model, where the model converged faster and was better suitable for the complex orchard environment. The new model can widely be expected to effectively realize the detection of green target fruit in the complex orchard environment with multiple postures, illumination, and scenes, indicating better generalization ability and robustness. The accuracies of detection were 93.27% and 91.35%, respectively, when testing green persimmons and green apples. Consequently, the new optimized model presented the best performance, compared with the conventional. The finding can also provide a sound theoretical reference for the target detection of green fruits and vegetables in the intelligent yield measurement and automated harvesting in orchards.
Keywords:object detection  Green Fruit  Bootstrapping  Transfer learning  Transformer networks
本文献已被 CNKI 等数据库收录!
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号