首页> 美国卫生研究院文献>ACS AuthorChoice >Influence of Varying Training Set Composition andSize on Support Vector Machine-Based Prediction of Active Compounds

【2h】

Influence of Varying Training Set Composition andSize on Support Vector Machine-Based Prediction of Active Compounds

机译：训练集组成和变化的影响基于支持向量机的活性化合物预测大小

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Support vector machine (SVM) modeling is one of the most popular machine learning approaches in chemoinformatics and drug design. The influence of training set composition and size on predictions currently is an underinvestigated issue in SVM modeling. In this study, we have derived SVM classification and ranking models for a variety of compound activity classes under systematic variation of the number of positive and negative training examples. With increasing numbers of negative training compounds, SVM classification calculations became increasingly accurate and stable. However, this was only the case if a required threshold of positive training examples was also reached. In addition, consideration of class weights and optimization of cost factors substantially aided in balancing the calculations for increasing numbers of negative training examples. Taken together, the results of our analysis have practical implications for SVM learning and the prediction of active compounds. For all compound classes under study, top recall performance and independence of compound recall of training set composition wasachieved when 250–500 active and 500–1000 randomly selectedinactive training instances were used. However, as long as ∼50known active compounds were available for training, increasing numbers of 500–1000randomly selected negative training examples significantly improvedmodel performance and gave very similar results for different trainingsets.

机译：支持向量机（SVM）建模是化学信息学和药物设计中最受欢迎的机器学习方法之一。训练集的组成和大小对预测的影响目前在SVM建模中尚未得到充分研究。在这项研究中，我们在正负训练示例数量的系统变化下，推导了各种复合活动类别的SVM分类和排名模型。随着否定训练化合物数量的增加，SVM分类计算变得越来越准确和稳定。但是，只有在达到正面训练示例的要求阈值的情况下，才是这种情况。另外，考虑班级权重和成本因素的优化在很大程度上有助于平衡计算，以增加负面训练样本的数量。两者合计，我们的分析结果对SVM学习和活性化合物的预测具有实际意义。对于所有正在研究的复合课程，最佳回忆表现和训练集组成的复合回忆的独立性为250-500个活动和500-1000个随机选择时达到使用了非活动训练实例。但是，只要〜50已知的活性化合物可用于培训，数量增加了500–1000随机选择的负面训练实例有明显改善模型的性能，并针对不同的训练给出非常相似的结果套。

著录项

期刊名称 ACS AuthorChoice
作者
Raquel Rodríguez-Pérez; Martin Vogt; Jürgen Bajorath; *;
展开▼
作者单位

展开▼
年(卷),期 -1(57),4
年度 -1
页码 710–716
总页数 7
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Comparison of confirmed inactive and randomly selected compounds as negative training examples in support vector machine-based virtual screening [J] . Heikamp K., Bajorath J. Journal of chemical information and modeling . 2013,第7期

机译：在基于支持向量机的虚拟筛选中比较确认的无活性和随机选择的化合物作为阴性训练实例
2. Discriminative Weight Training for Support Vector Machine-Based Speech/Music Classification in 3GPP2 SMV Codec [J] . Sang-Kyun KIM, Joon-Hyuk CHANG IEICE Transactions on fundamentals of electronics, communications & computer sciences . 2010,第1期

机译：3GPP2 SMV编解码器中基于支持向量机的语音/音乐分类的判别权重训练
3. Locating facial landmarks by support vector machine-based active shape model [J] . Chunhua Du, Jie Yang, Qiang Wu, International Journal of Intelligent Systems Technologies and Applications . 2011,第2期

机译：通过基于支持向量机的活动形状模型定位面部标志
4. Training Set of Support Vector Regression Extracted by Empirical Mode Decomposition [C] . Han Zhong-he, Zhu Xiao-xun 2011 Asia-Pacific Power and Energy Engineering Conference . 2011

机译：经验模态分解提取支持向量回归训练集
5. Active learning with support vector machines for imbalanced datasets and a method for stopping active learning based on stabilizing predictions. [D] . Bloodgood, Michael. 2009

机译：支持向量机用于不平衡数据集的主动学习，以及一种基于稳定预测的主动学习停止方法。
6. Support Vector Machine-Based Mucin-Type O-linked Glycosylation Site Prediction Using Enhanced Sequence Feature Encoding [O] . Manabu Torii, Hongfang Liu, Zhang-Zhi Hu 2009

机译：支持向量机的增强型序列特征编码的Mucin型O联糖基化位点预测
7. Influence of Varying Training Set Composition and Size on Support Vector Machine-Based Prediction of Active Compounds [O] . Rodríguez-Pérez, Raquel, Vogt, Martin, Bajorath, Jürgen 2017

机译：训练集组成和大小的变化对基于支持向量机的活性化合物预测的影响

Influence of Varying Training Set Composition andSize on Support Vector Machine-Based Prediction of Active Compounds

摘要

著录项

相似文献

相关主题

期刊订阅