首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种面向农业信息主题网络爬虫的设计
引用本文:汪斌,张云伟,刘健,陈晶.一种面向农业信息主题网络爬虫的设计[J].安徽农业科学,2009,37(20):9699-9700.
作者姓名:汪斌  张云伟  刘健  陈晶
作者单位:昆明理工大学现代农业工程学院,云南昆明,650224
摘    要:针对用户在进行农业信息主题或相关领域的网络查询时,通用搜索引擎返回的信息过多且主题相关性不强等不足,提出了一种面向农业信息的主题爬虫的设计方案,详细讨论了该主题爬虫的爬行策略、结构设计、原理及实现。初步试验结果表明,基于该设计方案的主题爬虫在抓取农业信息主题网页时的准确率、全面率及成功率明显优于普通爬虫。

关 键 词:主题爬虫  搜索引擎  农业信息  主题相关度

Design of an Agricultural Information Focused Web Crawler
WANG Bin et al.Design of an Agricultural Information Focused Web Crawler[J].Journal of Anhui Agricultural Sciences,2009,37(20):9699-9700.
Authors:WANG Bin
Institution:WANG Bin et al(Faculty of Modern Agricultural Engineering,Kunming University of Science , Technology,Kunming,Yunnan 650224)
Abstract:An agricultural information focused web crawler was designed to improve that when people searched agricultural information,general search engine often returned too much but non-relevance information.Its crawling strategy,structure design,working principle and implementation were discussed in details.The results of preliminary experiment showed that the focused crawler based on this design obviously more accurately and efficiently than ordinary one when crawling agricultural pages.
Keywords:Focused crawler  Search engine  Agricultural information  Degree of theme correlation
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号