首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于支持向量机的中文农业文本分类技术研究
引用本文:魏芳芳,段青玲,肖晓琰,张磊.基于支持向量机的中文农业文本分类技术研究[J].农业机械学报,2015,46(S1):174-179.
作者姓名:魏芳芳  段青玲  肖晓琰  张磊
作者单位:中国农业大学,中国农业大学;北京市农业物联网工程技术研究中心,中国农业大学,中国农业大学
基金项目:国家高技术研究发展计划(863计划)资助项目(2013AA102306)、山东省自主创新资助项目(2014XGA13054)和中央高校基本科研业务费专项资金资助项目(2015XD001)
摘    要:高效地组织、分类信息,是提供个性化农业信息推荐服务的基础。根据农业文本信息特点,提出了一种基于线性支持向量机(Support vector machine,SVM)的中文农业文本分类模型,首先构建农业行业分类关键词库,通过特征词选择和权重计算,构建分类器模型,实现信息的自动分类。实验选取了1 071个测试文档,并按照种植业、林业、畜牧业、渔业进行分类。结果表明,分类准确率为96.5%,召回率为96.4%。实验结果高于贝叶斯、决策树、KNN、SMO等分类算法,将该模型应用于农业物联网行业信息综合服务平台,运行结果表明,该方法能够实现中文农业文本信息的自动分类,响应时间满足系统要求。

关 键 词:文本分类  支持向量机  中文农业信息  信息整合
收稿时间:2015/10/28 0:00:00

Classification Technique of Chinese Agricultural Text Information Based on SVM
Wei Fangfang,Duan Qingling,Xiao Xiaoyan and Zhang Lei.Classification Technique of Chinese Agricultural Text Information Based on SVM[J].Transactions of the Chinese Society of Agricultural Machinery,2015,46(S1):174-179.
Authors:Wei Fangfang  Duan Qingling  Xiao Xiaoyan and Zhang Lei
Institution:China Agricultural University,China Agricultural University;Beijing Engineering Research Center of Agricultural Internet of Things,China Agricultural University and China Agricultural University
Abstract:In order to provide personalized services for agricultural information recommendation, it was needed to organize and classify information efficiently. According to the characteristics of agricultural texts, a Chinese agricultural text classification model was proposed based on linear support vector machine (SVM). Firstly, an agriculture-domain-based dictionary was built. Secondly, a feature vector was extracted and the weight for each feature in a vector was selected. Lastly, a text classification model was established. The model was tested on 1 071 documents which were belonged to four classes: planting, forestry, animal husbandry and fisheries. The results showed that the accuracy was 96.5% and the recall rate was 96.4%. Both of their performances were higher than those of the ones using other classification methods, such as the Bayesian, decision tree, KNN, SMO algorithm and neural network. The model was applied to the platform for agricultural internet of things (IOT) industry integrated information service. The performance showed that the method can automatically classify Chinese agricultural text information and the response time met the system requirements.
Keywords:Text classification  Support vector machine  Chinese agricultural information  Information integration
点击此处可从《农业机械学报》浏览原始摘要信息
点击此处可从《农业机械学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号