首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于半监督主动学习的菊花表型分类研究
引用本文:袁培森,任守纲,翟肇裕,徐焕良.基于半监督主动学习的菊花表型分类研究[J].农业机械学报,2018,49(9):27-34.
作者姓名:袁培森  任守纲  翟肇裕  徐焕良
作者单位:南京农业大学信息科学技术学院;国家信息农业工程技术中心;马德里理工大学技术工程和电信系统高级学院
基金项目:国家自然科学基金项目(61502236)和中央高校基本科研业务费专项资金项目(KYZ201752、KJQN201651)
摘    要:鉴于人工和专家分类模式的局限性,基于表型的菊花分类存在效率低下的问题。本文采用基于半监督主动学习技术,在已分类菊花数据的基础上,利用未标号菊花样本数据提供的信息,建立了菊花表型分类模型,提升了分类质量和效率。该模型可以不依赖外界交互,利用未标号样本来自动提升菊花分类的质量。为了训练学习模型,本文收集了菊花的表型特征数据,标注了菊花表型类别,并研究了菊花分类属性特征的编码技术。在此数据集上,采用基于图标号传播的半监督学习技术对未标号的菊花数据进行建模,为了提升半监督分类的有效性,在标号传播的基础上使用主动学习技术,采用熵最大策略来选择难以识别的样本,以改进分类质量。在该数据集上进行了试验验证,并进行了试验对比和分析,试验结果表明,本文方法能够较好地利用未标号菊花样本提升分类的精度,随着标号百分比从6.25%升至23%,识别精度达到0.7以上,标号百分比在81.25%时,平均识别精度和召回率分别达到0.91和0.88。

关 键 词:菊花表型分类    半监督学习    图模型    one-hot编码    主动学习    熵最大化
收稿时间:2018/3/24 0:00:00

Chrysanthemum Phenotypic Classification Based on Semi-supervised Active Learning
YUAN Peisen,REN Shougang,ZHAI Zhaoyu and XU Huanliang.Chrysanthemum Phenotypic Classification Based on Semi-supervised Active Learning[J].Transactions of the Chinese Society of Agricultural Machinery,2018,49(9):27-34.
Authors:YUAN Peisen  REN Shougang  ZHAI Zhaoyu and XU Huanliang
Institution:Nanjing Agricultural University,Nanjing Agricultural University;National Engineering and Technology Center for Agriculture,Technical University of Madrid and Nanjing Agricultural University;National Engineering and Technology Center for Agriculture
Abstract:Phenotype-based classification plays an essential role in plant research. Chrysanthemum flower has great momentous economic value and medicinal value, and has feature of morphological and genetic diversity as well. Due to the limitations of the artificial classification model by expert and the characteristic of genetic diversity, phenotype-based classification has been facing great challenges for its research. At present, the technologies and applications of machine learning and artificial intelligence are developing rapidly. With the vehicle of machine learning, the semi-supervised learning technology was employed to provide an effective way for improving the classification performance. This method was based on label propagation of graph model as well as active learning technique. According to this method, a small number of classified chrysanthemum data as well as a large amount of unlabeled chrysanthemum samples were exploited to improve the classification accuracy. This method can automatically make use of the unlabeled samples to improve the quality of chrysanthemum classification without relying on external interactions. The chrysanthemum phenotypic data was collected to train the learning model, and manually annotate the chrysanthemum category information. For exploiting the categorical attribute, the coding skill was studied as well. The label propagation of graph model was utilized by the semi-supervised learning skill for the unlabeled chrysanthemums. In order to improve the effectiveness of semi-supervised classification, active learning technique was applied, which was based on the entropy maximization strategy to select difficult-to-identify samples to improve classification performance further. Extensive experiments were conducted and comparisons were made. The experimental results showed that the unlabeled chrysanthemum samples can improve the classification accuracy remarkably, with the labeled ratio increasing from 6.25% to 23%, the recognition accuracy rapidly reached 0.7, the average recognition accuracy and recall rate can reach 0.91 and 0.88, respectively, when the labeled ratio was 81.25%. In conclusion, semi-supervised based learning for the intelligent identification and effective management of chrysanthemum flowers had great significance in theory and application for the studying of chrysanthemum phenotype.
Keywords:chrysanthemum phenotype classification  semi supervised learning  graph model  one-hot encode  active learning  entropy maximum
本文献已被 CNKI 等数据库收录!
点击此处可从《农业机械学报》浏览原始摘要信息
点击此处可从《农业机械学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号