...
首页> 外文期刊>Journal of Computer-Aided Molecular Design >Investigation of classification methods for the prediction of activity in diverse chemical libraries.
【24h】

Investigation of classification methods for the prediction of activity in diverse chemical libraries.

机译:研究用于预测各种化学文库中活性的分类方法。

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Classification methods based on linear discriminant analysis, recursive partitioning, and hierarchical agglomerative clustering are examined for their ability to separate active and inactive compounds in a diverse chemical database. Topology-based descriptions of chemical structure from the Molconn-X and ISIS programs are used in conjunction with these classification techniques to identify ACE inhibitors, beta-adrenergic antagonists, and H2 receptor antagonists. Overall, discriminant analysis misclassifies the smallest number of active compounds, while recursive partitioning yields the lowest rate of misclassification among inactives. Binary structural keys from the ISIS package are found to generally outperform the whole-molecule Molconn-X descriptors, especially for identification of inactive compounds. For all targets and classification methods, sensitivity toward active compounds is increased by making repetitive classification using training sets that contain equal numbers of actives and inactives. These balanced training sets provide an average numerical class membership score which may be used to select subsets of compounds that are enriched in actives.
机译:研究了基于线性判别分析,递归分区和分层聚集聚类的分类方法在各种化学数据库中分离活性和非活性化合物的能力。来自Molconn-X和ISIS程序的基于化学结构的拓扑描述与这些分类技术结合使用,以识别ACE抑制剂,β-肾上腺素能拮抗剂和H2受体拮抗剂。总体而言,判别分析对最少数量的活性化合物进行了错误分类,而递归划分在非活性物质中产生的错误分类率最低。发现ISIS程序包中的二元结构键通常胜过整个分子的Molconn-X描述子,特别是在鉴定无活性化合物时。对于所有目标和分类方法,通过使用包含相等数量的活性物质和非活性物质的训练集进行重复分类,可以提高对活性化合物的敏感性。这些平衡的训练集提供了平均数字类成员评分,可用于选择富含活性成分的化合物子集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号