...
首页> 外文期刊>Chemometrics and Intelligent Laboratory Systems >Prediction of the complexation of structurally diverse compounds with β-cyclodextrin using structural descriptors derived from electrostatic potentials on molecular surface and different chemometric methods
【24h】

Prediction of the complexation of structurally diverse compounds with β-cyclodextrin using structural descriptors derived from electrostatic potentials on molecular surface and different chemometric methods

机译:使用衍生自分子表面静电势和不同化学计量方法的结构描述符预测结构多样的化合物与β-环糊精的络合

获取原文
获取原文并翻译 | 示例
           

摘要

A quantitative structure-property relationship (QSPR) study was performed for predicting the complexation of structurally diverse compounds with β-cyclodextrin (β-CD). Six statistical methods, which include conventional multiple linear regression (MLR) and partial least-squares regression (PLS), and some up-to-date modeling techniques-support vector machine (SVM), least-squares support vector machine (LSSVM), random forest (RF) and Gaussian process (GP), were utilized to build the QSPR models. Systematical validations including internal leave-one-out cross-validation, the validation for external test set, as well as a more rigorous Monte Carlo cross-validation were also performed to confirm the reliability of the constructed models. Among these modeling methods, the GP, which can handle linear and nonlinear-hybrid relationship through a mixed covariance function, showed the best fitting and predictive abilities. The coefficient of determination r_(pred)~2 and root mean square error of prediction (RMSEP) for the external test set were 0.832 and 0.373, respectively. Physical meanings of all structural descriptors introduced, which include six quantities derived from electrostatic potential on molecular surface (ESPMS) and the energy level of highest occupied molecular orbital (E_(HOMO)), were elucidated. Some simple comparisons with previous QSPR results for the same or similar data sets were also made.
机译:进行了定量结构-性质关系(QSPR)研究,以预测结构多样的化合物与β-环糊精(β-CD)的络合。六种统计方法,包括常规的多元线性回归(MLR)和偏最小二乘回归(PLS),以及一些最新的建模技术-支持向量机(SVM),最小二乘支持向量机(LSSVM),利用随机森林(RF)和高斯过程(GP)来建立QSPR模型。还进行了系统验证,包括内部留一法交叉验证,外部测试集验证以及更严格的蒙特卡洛交叉验证,以确认所构建模型的可靠性。在这些建模方法中,可以通过混合协方差函数处理线性和非线性-混合关系的GP显示出最佳的拟合和预测能力。外部测试集的确定系数r_(pred)〜2和预测均方根误差(RMSEP)分别为0.832和0.373。阐明了所引入的所有结构描述符的物理含义,包括从分子表面静电势(ESPMS)和最高占据分子轨道的能级(E_(HOMO))得出的六个量。对于相同或相似的数据集,还与先前的QSPR结果进行了一些简单的比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号