首页 | 本学科首页   官方微博 | 高级检索  
     

基于双重语义空间的农业信息检索模型研究
引用本文:陈燕红,张太红,冯向萍,白涛,马健. 基于双重语义空间的农业信息检索模型研究[J]. 新疆农业大学学报, 2012, 35(3): 253-258
作者姓名:陈燕红  张太红  冯向萍  白涛  马健
作者单位:新疆农业大学计算机与信息工程学院,乌鲁木齐,830052
基金项目:新疆维吾尔自治区科技攻关项目
摘    要:为了提高针对大规模农业信息的语义检索性能,提出一种基于改进的随机索引语义空间和潜在语义空间的农业信息检索模型(IRI&LSA)。利用120万张中文网页和2 000张分为4类的小规模中文农业网页,对IRI&LSA和两种分别基于单向量兰克泽斯算法(LAS2)和半离散矩阵分解算法(SDD)的常用潜在语义检索模型(LSA-LAS2和LSA-SDD)进行了对比实验。结果表明,IRI&LSA检索结果的平均F1值可达83%,明显高于LSA-LAS2(71%)和LSA-SDD(64%);IRI&LSA的检索速度分别是LSA-LAS2和LSA-SDD的3.6倍和4.9倍。研究结果表明,IRI&LSA适合应用于较大规模农业信息检索。

关 键 词:农业信息检索  随机索引  潜在语义分析  IRI&LSA

Research on Agricultural Information Retrieval Model Based on Double Semantic Space
CHEN Yan-hong , ZHANG Tai-hong , FENG Xiang-ping , BAI Tao , MA Jian. Research on Agricultural Information Retrieval Model Based on Double Semantic Space[J]. Journal of Xinjiang Agricultural University, 2012, 35(3): 253-258
Authors:CHEN Yan-hong    ZHANG Tai-hong    FENG Xiang-ping    BAI Tao    MA Jian
Affiliation:(College of Computer and Information Engineering,Xinjiang Agricultural University,Urumqi 830052,China)
Abstract:In order to improve semantic retrieval function of massive agricultural information,an agricultural information search modle(IRI&LSA) was proposed,based on improved radom index semantic space and latent sematic space.The contrast experiments were conducted between IRI&LSA and two commonly used latent semantic models(LSA-LAS2 and LSA-SDD) by using 1.2 million Chinese web pages and 2 000 Chinese agricultural web pages that were divided into four categories,based on single-vector lanczos algorithm(LAS2) and semi-discrete matrix decomposition algorithm(SDD) respectively.These results showed that the average F1 value of search results of IRI&LSA reached 83% that was significantly higher than LSA-LAS2(71%) and LSA-SDD(64%);retrieval speed of IRI&LSA was LSA-LAS2’s 3.6 times and LSA-SDD’s 4.9 times.Experimental results showed that IRI&LSA was suitable for massive agricultural information retrieval.
Keywords:agricultural information retrieval  RI  LSA  IRI&LSA
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号