Application of k-means clustering, linear discriminant analysis and multivariate linear regression for the development of a predictive QSAR model on 5-lipoxygenase inhibitors

Andrada Matias F.; Vega-Hissi Esteban G.; Estrada Mario R.; Garro Martinez Juan C.

首页> 外文期刊>Chemometrics and Intelligent Laboratory Systems >Application of k-means clustering, linear discriminant analysis and multivariate linear regression for the development of a predictive QSAR model on 5-lipoxygenase inhibitors

【24h】

Application of k-means clustering, linear discriminant analysis and multivariate linear regression for the development of a predictive QSAR model on 5-lipoxygenase inhibitors

机译：k均值聚类，线性判别分析和多元线性回归在建立5-脂氧合酶抑制剂预测QSAR模型中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this work, we performed a quantitative structure activity relationship (QSAR) model for a family of 5-lipoxygenase (5-LOX) inhibitors using k-means clustering and linear discriminant analysis (LDA) for the selection of training and test sets and multivariate linear regression (MLR) for the independent variable selection. With the k-means clustering method, the total set of compounds (58 derivatives of 5-Benzylidene-2-phenylthiazolinones) was divided in two clusters according to a simple discriminant function. We found that pilD (conventional bond order ID number) molecular descriptor discriminates correctly 100% of the compounds of each clusters. Thirty different models divided in three series were analyzed and the series with representative training and test sets (series 3) had the most predictive models. The statistical parameters of the best model are R-train = 0.811 and R-test = 0.801. We found that a rational selection in the setting-up of training and test sets allows to obtain the most predictive models and the random selection is sometimes unsuitable, especially, when the total set of compounds can be classified in different clusters according to structural features. (C) 2015 Elsevier B.V. All rights reserved.

机译：在这项工作中，我们使用k均值聚类和线性判别分析（LDA）来选择训练和测试集以及多元变量，对5-脂氧合酶（5-LOX）抑制剂家族进行了定量结构活性关系（QSAR）模型自变量选择的线性回归（MLR）。使用k-均值聚类方法，根据简单的判别函数，将全部化合物（5-苄叉基-2-苯基噻唑啉酮的58个衍生物）分为两个簇。我们发现pilD（常规键序ID号）分子描述符正确地区分了每个簇的100％的化合物。分析了分为三个系列的30种不同模型，具有代表性的训练和测试集的系列（系列3）的预测模型最多。最佳模型的统计参数为R-train = 0.811和R-test = 0.801。我们发现，在训练和测试集的设置中进行合理选择可以获取最具预测性的模型，而随机选择有时是不合适的，尤其是当可以根据结构特征将全部化合物分类为不同的簇时。（C）2015 Elsevier B.V.保留所有权利。

著录项

来源
《Chemometrics and Intelligent Laboratory Systems》 |2015年第null期|共8页
作者
Andrada Matias F.; Vega-Hissi Esteban G.; Estrada Mario R.; Garro Martinez Juan C.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计量学;
关键词
QSAR; 5-Lipoxygenase inhibitors; k-Means clustering; Linear discriminant analysis; Multivariate linear regression;

机译：QSAR;5-脂氧合酶抑制剂;k-Means聚类;线性判别分析;多元线性回归;

相似文献

外文文献
中文文献
专利

1. Application of k-means clustering, linear discriminant analysis and multivariate linear regression for the development of a predictive QSAR model on 5-lipoxygenase inhibitors [J] . Andrada Matias F., Vega-Hissi Esteban G., Estrada Mario R., Chemometrics and Intelligent Laboratory Systems . 2015,第Null期

机译：k均值聚类，线性判别分析和多元线性回归在建立5-脂氧合酶抑制剂预测QSAR模型中的应用
2. Predicting the morbidity of chronic obstructive pulmonary disease based on multiple locally weighted linear regression model with K-means clustering [J] . Zhi-yong Huang, Shuang Lin, Li-li Long, International journal of medical informatics . 2020,第Jula期

机译：基于多个局部加权线性回归模型预测慢性阻塞性肺疾病的发病率，K均值聚类
3. Linear regression models and k-means clustering for statistical analysis of fNIRS data [J] . Viola Bonomini Lucia Zucchelli Rebecca Re Francesca Ieva Lorenzo Spinelli Davide Contini Anna Paganoni, Alessandro Torricelli Biomedical Optics Express . 2015,第2期

机译：用于fNIRS数据统计分析的线性回归模型和k-均值聚类
4. A Comparison of Logistic Regression and Linear Discriminant Analysis in Predicting of Female Students Attrition from School in Bangladesh [C] . Mohammad Nayeem Hasan International Conference on Electrical Information and Communication Technology . 2019

机译：Logistic回归与线性判别分析在孟加拉国女学生减员预测中的比较
5. A COMPARISON OF SIX MODELS FOR PREDICTING CORPORATE BANKRUPTCY: MULTIPLE LINEAR REGRESSION ANALYSIS, MULTIPLE LINEAR DISCRIMINANT ANALYSIS, STEPWISE REGRESSION ANALYSIS, STEPWISE DISCRIMINANT ANALYSIS, MULTIPLE LINEAR REGRESSION ANALYSIS WITH RIDGE REGRESSION, AND MULTIPLE LINEAR DISCRIMINANT ANALYSIS WITH BIASED MINIMUM CHI-SQUARE RULE [D] . MAPP, JOHNNIE ALBERT. 1981

机译：六种预测公司破产的模型的比较：多个线性回归分析，多个线性判别分析，逐步回归分析，逐步判别分析，多个带岭点回归的线性回归分析，以及多个线性离散
6. Linear regression models and k-means clustering for statistical analysis of fNIRS data [O] . Viola Bonomini, Lucia Zucchelli, Rebecca Re, 2015

机译：用于fNIRS数据统计分析的线性回归模型和k-均值聚类
7. Linear regression models and k-means clustering for statistical analysis of fNIRS data [O] . V. Bonomini, L. Zucchelli, R. Re, 2015

机译：用于fNIRS数据统计分析的线性回归模型和k-均值聚类

Application of k-means clustering, linear discriminant analysis and multivariate linear regression for the development of a predictive QSAR model on 5-lipoxygenase inhibitors

摘要

著录项

相似文献

相关主题

期刊订阅