首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于支持向量回归的非线性变量筛选用于酚类化合物QSAR建模的研究
引用本文:徐镜善,王凯,袁哲明.基于支持向量回归的非线性变量筛选用于酚类化合物QSAR建模的研究[J].安徽农业科学,2014(13):3799-3801.
作者姓名:徐镜善  王凯  袁哲明
作者单位:湖南省植物病虫害生物学与防控重点实验室;湖南大众传媒职业技术学院;湖南省作物种质创新与资源利用国家重点实验室培育基地;
基金项目:教育部高等学校博士点专项基金(200805370002)
摘    要:首先基于支持向量回归(SVR)依均方根误差最小原则确定最优核函数,再以最优核函数为基础,进行SVR非线性自变量筛选,最后以所选自变量进行建模预测.将该方法应用于酚类化合物的QSAR研究,最优核函数确定为径向基核,最终保留自变量为疏水性参数(lgp)与拓扑指数(Am3).结果表明:基于SVR进行变量筛选能有效地剔除无关自变量,进一步改进SVR对小样本数据的建模预测能力.该方法在农业环境毒性污染物的QSAR研究领域有较广泛的应用前景.

关 键 词:支持向量回归  自变量筛选  定量构效关系  酚类化合物

Nonlinear Variable Screening Based on Support Vector Regression and Its Application on Phenol Compounds QSAR Modeling
XU Jing-shan;YUAN Zhe-ming.Nonlinear Variable Screening Based on Support Vector Regression and Its Application on Phenol Compounds QSAR Modeling[J].Journal of Anhui Agricultural Sciences,2014(13):3799-3801.
Authors:XU Jing-shan;YUAN Zhe-ming
Institution:XU Jing-shan;YUAN Zhe-ming;Hunan Provincial Key Laboratory for Biology and Control of Plant Diseases and Insect Pests;Hunan Mass Media Vocational Technical College;Hunan Provincial Key Laboratory for Germplasm Innovation and Utilization of Crop;
Abstract:Firstly,the optimal kernel function was determined in accordance with the minimum root mean square error based on support vector regression (SVR),and then on the basis of the optimal kernel function,the independent variables were screened nonlinearly using SVR.Finally,modeling was conducted on the training set and prediction was performed on the test set using the selected independent variables.The method was applied to QSAR study of phenolic compounds,the optimal kernel function was determined as RBF kernel,the retained independent variables as hydrophobic parameter (lgP) and topological index Am3.The results show that irrelevant variables can be effectively eliminate using SVR to screen variables and prediction ability was further improved for SVR modeling on small sample data,this method has a potential application prospect in the QSAR research field of environmental toxic pollutants of agriculture.
Keywords:Support vector regression  Independent variable screening  Quantitative structure-activity relationships  Phenol compounds
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号