首页> 外文学位 >Kernel methods in computer-aided constructive drug design.
【24h】

Kernel methods in computer-aided constructive drug design.

机译:计算机辅助构造药物设计​​中的内核方法。

获取原文
获取原文并翻译 | 示例

摘要

A drug is typically a small molecule that interacts with the binding site of some target protein. Drug design involves the optimization of this interaction so that the drug effectively binds with the target protein while not binding with other proteins (an event that could produce dangerous side effects). Computational drug design involves the geometric modeling of drug molecules, with the goal of generating similar molecules that will be more effective drug candidates. It is necessary that algorithms incorporate strategies to measure molecular similarity by comparing molecular descriptors that may involve dozens to hundreds of attributes. We use kernel-based methods to define these measures of similarity. Kernels are general functions that can be used to formulate similarity comparisons.;The most important aspect of these contributions is the presentation of strategies that actually generate the structure of a new drug candidate. While the training set is still used to generate a new image point in the feature space, the reverse engineering strategies just described allows us to develop a new drug candidate that is independent of issues related to probability distribution constraints placed on test set molecules.;The overall goal of this thesis is to develop effective and efficient computational methods that are reliant on transparent mathematical descriptors of molecules with applications to affinity prediction, detection of multiple binding modes, and generation of new drug leads. While in this thesis we derive computational strategies for the discovery of new drug leads, our approach differs from the traditional ligand-based approach. We have developed novel procedures to calculate inverse mappings and subsequently recover the structure of a potential drug lead. The contributions of this thesis are the following: (1) We propose a vector space model molecular descriptor (VSMMD) based on a vector space model that is suitable for kernel studies in QSAR modeling. Our experiments have provided convincing comparative empirical evidence that our descriptor formulation in conjunction with kernel based regression algorithms can provide sufficient discrimination to predict various biological activities of a molecule with reasonable accuracy. (2) We present a new component selection algorithm KACS (Kernel Alignment Component Selection) based on kernel alignment for a QSAR study. Kernel alignment has been developed as a measure of similarity between two kernel functions. In our algorithm, we refine kernel alignment as an evaluation tool, using recursive component elimination to eventually select the most important components for classification. We have demonstrated empirically and proven theoretically that our algorithm works well for finding the most important components in different QSAR data sets. (3) We extend the VSMMD in conjunction with a kernel based clustering algorithm to the prediction of multiple binding modes, a challenging area of research that has been previously studied by means of time consuming docking simulations. The results reported in this study provide strong empirical evidence that our strategy has enough resolving power to distinguish multiple binding modes through the use of a standard k-means algorithm. (4) We develop a set of reverse engineering strategies for QSAR modeling based on our VSMMD. These strategies include: (a) The use of a kernel feature space algorithm to design or modify descriptor image points in a feature space. (b) The deployment of a pre-image algorithm to map the newly defined descriptor image points in the feature space back to the input space of the descriptors. (c) The design of a probabilistic strategy to convert new descriptors to meaningful chemical graph templates.
机译:药物通常是与某些目标蛋白的结合位点相互作用的小分子。药物设计涉及优化这种相互作用,以使药物有效结合靶蛋白,而不结合其他蛋白(可能产生危险副作用的事件)。计算药物设计涉及药物分子的几何建模,目的是生成类似的分子,这些分子将成为更有效的候选药物。算法必须包含通过比较可能涉及数十到数百个属性的分子描述符来测量分子相似性的策略。我们使用基于内核的方法来定义这些相似性度量。核心是可用于制定相似性比较的一般功能。这些贡献中最重要的方面是提出实际上产生新候选药物结构的策略。虽然训练集仍用于在特征空间中生成新的图像点,但刚刚描述的逆向工程策略使我们能够开发新的候选药物,而该候选药物独立于与测试集分子上的概率分布约束有关的问题。本文的总体目标是开发依赖于分子的透明数学描述符的有效且高效的计算方法,并将其应用于亲和力预测,多种结合模式的检测以及新药线索的产生。尽管在本文中我们推导了发现新药线索的计算策略,但我们的方法不同于传统的基于配体的方法。我们已经开发出新颖的程序来计算逆映射,并随后恢复潜在药物线索的结构。本文的主要工作有以下几个方面:(1)提出了一种基于向量空间模型的向量空间模型分子描述符(VSMMD),该模型适合于QSAR建模中的核研究。我们的实验提供了令人信服的比较经验证据,即我们的描述符公式与基于核的回归算法相结合,可以提供足够的区分度,以合理的准确性预测分子的各种生物学活性。 (2)针对QSAR研究,我们提出了一种基于内核对齐的新的组件选择算法KACS(内核对齐组件选择)。已经开发了内核对齐,以度量两个内核功能之间的相似性。在我们的算法中,我们使用递归组件消除来最终选择最重要的组件进行分类,从而将内核对齐作为评估工具进行了优化。我们已经通过经验证明并从理论上证明了我们的算法可以很好地找到不同QSAR数据集中最重要的组成部分。 (3)我们结合基于内核的聚类算法将VSMMD扩展到多种绑定模式的预测上,这是一个具有挑战性的研究领域,之前已经通过耗时的对接模拟进行了研究。这项研究报告的结果提供了有力的经验证据,表明我们的策略具有足够的解决能力,可以通过使用标准的k-均值算法来区分多种结合模式。 (4)我们基于VSMMD开发了一套用于QSAR建模的逆向工程策略。这些策略包括:(a)使用内核特征空间算法来设计或修改特征空间中的描述符图像点。 (b)部署前图像算法以将特征空间中新定义的描述符图像点映射回描述符的输入空间。 (c)设计将新的描述符转换为有意义的化学图模板的概率策略。

著录项

  • 作者

    Wong, William W. L.;

  • 作者单位

    University of Waterloo (Canada).;

  • 授予单位 University of Waterloo (Canada).;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 182 p.
  • 总页数 182
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号