首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于RAdam卷积神经网络的水稻生育期图像识别
引用本文:徐建鹏,王杰,徐祥,琚书存.基于RAdam卷积神经网络的水稻生育期图像识别[J].农业工程学报,2021,37(8):143-150.
作者姓名:徐建鹏  王杰  徐祥  琚书存
作者单位:1.安徽省农村综合经济信息中心,合肥 230031;2.安徽省农业生态大数据工程实验室,合肥 230031
基金项目:安徽省重大科技专项(202003A06020016);科技助力经济2020气象行业项目(KJZLJJ202002)
摘    要:为了解决现阶段水稻发育期信息的获取主要依靠人工观测的效率低、主观性强等问题,该研究提出一种基于Rectified Adam(RAdam)优化器的ResNet50卷积神经网络图像识别方法,开展水稻关键生育期的自动识别。连续2a对12块试验田的水稻物候特征进行持续自动拍摄,对采集的水稻图像进行预处理,得到水稻各发育期分类图像数据集;采用ExG因子和大津法(Otsu)算法相结合的方法对水稻图像分割,减小稻田背景干扰;对比分析了VGG16、VGG19、ResNet50和Inception v3四种模型下水稻生育期图像分级识别的性能,选取性能较优网络模型并进行了网络参数调优;对比试验了不同优化器下模型准确率和损失值的变化,选取了RAdam优化器。结果表明,采取基于RAdam优化器卷积神经网络构建的模型,在真实场景下分类识别准确率达到97.33%,网络稳定性高、收敛速度快,为水稻生育期自动化观测提供了有效方法。

关 键 词:图像识别  神经网络  模型  水稻  RAdam  ResNet50  生育期
收稿时间:2020/1/2 0:00:00
修稿时间:2021/3/10 0:00:00

Image recognition for different developmental stages of rice by RAdam deep convolutional neural networks
Xu Jianpeng,Wang Jie,Xu Xiang,Ju Shucun.Image recognition for different developmental stages of rice by RAdam deep convolutional neural networks[J].Transactions of the Chinese Society of Agricultural Engineering,2021,37(8):143-150.
Authors:Xu Jianpeng  Wang Jie  Xu Xiang  Ju Shucun
Institution:1.Anhui Rural Comprehensive Economic Information Center, Hefei 230031, China; 2.Anhui Agricultural Ecological Big Data Engineering Laboratory, Hefei 230031, China
Abstract:Abstract: An improved Convolutional Neural Network (CNN) was proposed to replace the current manual observation of the rice development period for higher efficiency and accuracy. In this study, a CNN image recognition was established with 50 layers using a risk adaptive authorization mechanism (RAdam) optimizer. Five developmental stages of rice were selected to automatically detect, including regreening, tillering, jointing, heading, and milk stage. Two cameras were assumed in 12 test fields for two consecutive years, where two pre-set points were set in each test field. Images and videos of rice were taken continuously at 8:00 and 16:00 each day. The geometric transformation of image was also used to increase the amount of input data. Finally, 35422 datasets of grading images were obtained on rice development stages. Training and test datasets were divided at the ratio of 7:3, where the original 1920x1080 pixel image was processed into 224x224 pixel size. Each image was then classified and labelled manually. A combined ExG factor with Otsu threshold was utilized to segment the rice images, to avoid the interference of some factors (water, soil, and garbage) in the rice field on the characteristics of rice development period. Strong robustness was obtained when the light and color changed, indicating high requirements of extracting the "green" characteristics of rice plant images. The parallel operation of CNN was realized by Tensor flow GPU. Four pre-trained CNN models were selected to conduct comparative experiments, including VGG16, VGG19, ResNet50, and Inception v3. The initial learning rate was set to be 0.001. The training accuracies of the VGG16, VGG19, and Inception v3 network models were 99.46%, 94.36%, and 98.70%, respectively whereas the verification accuracies were 94.76%, 89.43%, and 93.59%, respectively. The training accuracy of the ResNet50 network model was about 5% higher than that of the VGG19 network model, also higher than those of the VGG16, and Inception v3 network models. The loss value of the ResNet50 network model was also about 90% lower than those of models. Thus, it was inferred that the ResNet50 model was better suitable for the identification of key developmental stages of rice. Nevertheless, the accuracy and loss of the ResNet50 model varied greatly under the Adam and RAdam optimizers. The RAdam optimizer was about 10% faster than Adam, indicating high stability and convergence speed. Specifically, the convergence speed for Adam was 11 s per step, while that for RAdam was 12 s per step. Multiple experiments were performed on the batch size and learning rate, and further to evaluate the performance of the ResNet50 model. The training time was reduced by 7 372s, when the learning rate was set to be 0.001, and the batch size was 32. Subsequently, 5 experiments were performed on the ResNet50 network model to train the datasets of rice images during different developmental stages. The accuracies of the training and validation set were 99.53%, and 97.66%, respectively, when the training iteration reached the 18th round. Once the iterative training continued, the accuracies of the training and validation set remained stable. The constructed CNN model can be expected to recognize rice images in different developmental stages, with an average recognition accuracy of 97.33%, while high network stability and fast convergence speed. The finding can provide an effective way to automatically monitor the development stages of rice in intelligent agriculture.
Keywords:image recognition  neural networks  models  rice  RAdam  ResNet50  developmental stage
本文献已被 CNKI 等数据库收录!
点击此处可从《农业工程学报》浏览原始摘要信息
点击此处可从《农业工程学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号