Interpreting linear support vector machine models with heat map molecule coloring

Lars Rosenbaum; Georg Hinselmann; Andreas Jahn; Andreas Zell

首页> 外文期刊>Journal of Cheminformatics >Interpreting linear support vector machine models with heat map molecule coloring

【24h】

Interpreting linear support vector machine models with heat map molecule coloring

机译：用热图分子着色解释线性支持向量机模型

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Background Model-based virtual screening plays an important role in the early drug discovery stage. The outcomes of high-throughput screenings are a valuable source for machine learning algorithms to infer such models. Besides a strong performance, the interpretability of a machine learning model is a desired property to guide the optimization of a compound in later drug discovery stages. Linear support vector machines showed to have a convincing performance on large-scale data sets. The goal of this study is to present a heat map molecule coloring technique to interpret linear support vector machine models. Based on the weights of a linear model, the visualization approach colors each atom and bond of a compound according to its importance for activity. Results We evaluated our approach on a toxicity data set, a chromosome aberration data set, and the maximum unbiased validation data sets. The experiments show that our method sensibly visualizes structure-property and structure-activity relationships of a linear support vector machine model. The coloring of ligands in the binding pocket of several crystal structures of a maximum unbiased validation data set target indicates that our approach assists to determine the correct ligand orientation in the binding pocket. Additionally, the heat map coloring enables the identification of substructures important for the binding of an inhibitor. Conclusions In combination with heat map coloring, linear support vector machine models can help to guide the modification of a compound in later stages of drug discovery. Particularly substructures identified as important by our method might be a starting point for optimization of a lead compound. The heat map coloring should be considered as complementary to structure based modeling approaches. As such, it helps to get a better understanding of the binding mode of an inhibitor.

机译：背景技术基于模型的虚拟筛选在药物开发的早期阶段起着重要作用。高通量筛选的结果是机器学习算法推断此类模型的宝贵资源。除了强大的性能外，机器学习模型的可解释性也是在以后的药物发现阶段指导化合物优化的理想属性。线性支持向量机在大规模数据集上显示出令人信服的性能。这项研究的目的是提出一种热图分子着色技术来解释线性支持向量机模型。根据线性模型的权重，可视化方法根据化合物对活性的重要性为化合物的每个原子和键着色。结果我们在毒性数据集，染色体畸变数据集和最大无偏验证数据集上评估了我们的方法。实验表明，我们的方法可以直观地可视化线性支持向量机模型的结构属性和结构活动关系。最大无偏验证数据集目标的几个晶体结构的结合袋中配体的着色表明，我们的方法有助于确定结合袋中正确的配体取向。另外，热图着色使得能够鉴定对于抑制剂的结合重要的亚结构。结论结合热图着色，线性支持向量机模型可以帮助在药物发现的后期阶段指导化合物的修饰。特别是通过我们的方法确定为重要的子结构可能是优化铅化合物的起点。热图着色应被认为是对基于结构的建模方法的补充。这样，它有助于更好地理解抑制剂的结合方式。

著录项

来源
《Journal of Cheminformatics》 |2011年第s1期|共页
作者
Lars Rosenbaum; Georg Hinselmann; Andreas Jahn; Andreas Zell;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类化学;
关键词

相似文献

外文文献
中文文献
专利

1. Comparing generic and vectorial nonlinear manoeuvring models and parameter estimation using optimal truncated least square support vector machine [J] . Applied Ocean Research . 2020,第期

机译：使用最优截断最小二乘支持向量机比较通用和矢量非线性操作模型和参数估计
2. Interpreting support vector machine models for multivariate group wise analysis in neuroimaging [J] . Gaonkar Bilwaj, Shinohara Russell T., Davatzikos Christos Medical image analysis . 2015,第1期

机译：解释支持向量机模型的神经影像多元分组分析
3. Comparative Evaluation of Multiple Linear Regression and Support vector Machine aided Linear and Non-linear QSAR Models [J] . Joshi Shobha, Sharma Sonal, Yadav Mukesh Research Journal of Chemical Sciences . 2014,第7期

机译：多元线性回归和支持向量机辅助的线性和非线性QSAR模型的比较评估
4. On the complexity and interpretability of support vector machines for process modeling [C] . Pereira, C., Dourado, . 2002

机译：用于过程建模的支持向量机的复杂性和可解释性
5. Data mining via support vector machines: Scalability, applicability, and interpretability. [D] . Yu, Hwan-Jo. 2004

机译：通过支持向量机进行数据挖掘：可伸缩性，适用性和可解释性。
6. Interpreting linear support vector machine models with heat map molecule coloring [O] . Lars Rosenbaum, Georg Hinselmann, Andreas Jahn, 2011

机译：用热图分子着色解释线性支持向量机模型
7. Interpreting linear support vector machine models with heat map molecule coloring [O] . Lars Rosenbaum, Georg Hinselmann, Andreas Jahn, 2011

机译：用热图分子着色解释线性支持向量机模型

Interpreting linear support vector machine models with heat map molecule coloring

摘要

著录项

相似文献

相关主题

期刊订阅