Additive SMILES-Based Carcinogenicity Models: Probabilistic Principles in the Search for Robust Predictions

机译：基于SMILES的附加致癌性模型：寻找稳健预测的概率原理

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Optimal descriptors calculated with the simplified molecular input line entry system (SMILES) have been utilized in modeling of carcinogenicity as continuous values (logTD50). These descriptors can be calculated using correlation weights of SMILES attributes calculated by the Monte Carlo method. A considerable subset of these attributes includes rare attributes. The use of these rare attributes can lead to overtraining. One can avoid the influence of the rare attributes if their correlation weights are fixed to zero. A function, limS, has been defined to identify rare attributes. The limS defines the minimum number of occurrences in the set of structures of the training (subtraining) set, to accept attributes as usable. If an attribute is present less than limS, it is considered “rare”, and thus not used. Two systems of building up models were examined: 1. classic training-test system; 2. balance of correlations for the subtraining and calibration sets (together, they are the original training set: the function of the calibration set is imitation of a preliminary test set). Three random splits into subtraining, calibration, and test sets were analysed. Comparison of abovementioned systems has shown that balance of correlations gives more robust prediction of the carcinogenicity for all three splits (split 1: rtest²=0.7514, stest=0.684; split 2: rtest²=0.7998, stest=0.600; split 3: rtest²=0.7192, stest=0.728).

机译：通过简化的分子输入线输入系统（SMILES）计算出的最佳描述子已在致癌性建模中用作连续值（logTD50）。可以使用通过蒙特卡洛方法计算的SMILES属性的相关权重来计算这些描述符。这些属性的相当一部分包括稀有属性。这些稀有属性的使用可能导致过度训练。如果将它们的相关权重固定为零，则可以避免稀有属性的影响。已定义函数limS来识别稀有属性。 limS定义训练（子训练）集合的结构集合中出现的最小次数，以接受可用属性。如果存在的属性小于limS，则将其视为“稀有”，因此不使用。研究了两种建立模型的系统：1.经典的训练测试系统； 2.子训练集和校准集的相关性的平衡（它们都是原始训练集：校准集的功能是模仿初步测试集）。分析了三个随机拆分，分别是子训练，校准和测试集。上述系统的比较表明，相关性的平衡给出了所有三个分割的致癌性的更可靠的预测（分割1：rtest ^{2 = 0.7514，stest = 0.684；分割2：rtest ^{2 < /sup>=0.7998，stest=0.600；拆分3：rtest ^{2 = 0.7192，stest = 0.728）。}}}

著录项

期刊名称 International Journal of Molecular Sciences
作者
Andrey A. Toropov; Alla P. Toropova; Emilio Benfenati;
展开▼
作者单位

展开▼
年(卷),期 2009(10),7
年度 2009
页码 3106–3127
总页数 22
原文格式 PDF
正文语种
中图分类分子生物学;
关键词
QSAR SMILES optimal descriptor carcinogenicity balance of correlations applicability domain;

机译：QSAR;SMILES;最佳描述符;致癌性;相关性平衡;适用范围;

相似文献

外文文献
中文文献
专利

1. Additive SMILES-Based Carcinogenicity Models: Probabilistic Principles in the Search for Robust Predictions [J] . Alla P. Toropova, Andrey A. Toropov, Emilio Benfenati International Journal of Molecular Sciences . 2009,第7期

机译：基于SMILES的附加致癌性模型：寻找稳健预测的概率原理
2. Additive models and robust aggregation for GEFCom2014 probabilistic electric load and electricity price forecasting [J] . Gaillard Pierre, Goude Yannig, Nedellec Raphael International journal of forecasting . 2016,第3期

机译：GEFCom2014概率电力负荷和电价预测的附加模型和鲁棒聚合
3. Probabilistic hierarchical Bayesian framework for time-domain model updating and robust predictions [J] . Sedehi Omid, Papadimitriou Costas, Katafygiotis Lambros S. Mechanical systems and signal processing . 2019,第MAYa15期

机译：时域模型更新和鲁棒预测的概率分层贝叶斯框架
4. A Binary Probabilistic Model and Genetic Algorithm for HIV Protease Cleavage Sites Prediction and Search [C] . Zheng Rong Yang 第8届国际神经信息处理大会 . 2001

机译：HIV蛋白酶裂解位点的二进制概率模型和遗传算法。
5. An Integrated Model for the Probabilistic Prediction of Yield Strength in Electron-Beam Additively Manufactured Ti-6Al-4V [D] . Ales, Thomas K. 2018

机译：电子束增材制造的Ti-6Al-4V屈服强度概率预测的集成模型
6. Can Population Modelling Principles be Used to Identify Key PBPK Parameters for Paediatric Clearance Predictions? An Innovative Application of Optimal Design Theory [O] . Elisa A. M. Calvier, Thu Thuy Nguyen, Trevor N. Johnson, -1

机译：可以使用总体建模原理来确定用于儿科清除率预测的关键PBPK参数吗？最优设计理论的创新应用
7. Additive SMILES-Based Carcinogenicity Models: Probabilistic Principles in the Search for Robust Predictions [O] . Toropov, Andrey A., Toropova, Alla P., Benfenati, Emilio 2009

机译：基于SMILES的附加致癌性模型：寻找稳健预测的概率原理

Additive SMILES-Based Carcinogenicity Models: Probabilistic Principles in the Search for Robust Predictions

摘要

著录项

相似文献

相关主题

期刊订阅