首页> 美国卫生研究院文献>Molecules >Androgen Receptor Binding Category Prediction with Deep Neural Networks and Structure- Ligand- and Statistically Based Features
【2h】

Androgen Receptor Binding Category Prediction with Deep Neural Networks and Structure- Ligand- and Statistically Based Features

机译:雄激素受体绑定类别预测深神经网络和结构 - 配体和统计基于特征

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Substances that can modify the androgen receptor pathway in humans and animals are entering the environment and food chain with the proven ability to disrupt hormonal systems and leading to toxicity and adverse effects on reproduction, brain development, and prostate cancer, among others. State-of-the-art databases with experimental data of human, chimp, and rat effects by chemicals have been used to build machine-learning classifiers and regressors and to evaluate these on independent sets. Different featurizations, algorithms, and protein structures lead to different results, with deep neural networks (DNNs) on user-defined physicochemically relevant features developed for this work outperforming graph convolutional, random forest, and large featurizations. The results show that these user-provided structure-, ligand-, and statistically based features and specific DNNs provided the best results as determined by AUC (0.87), MCC (0.47), and other metrics and by their interpretability and chemical meaning of the descriptors/features. In addition, the same features in the DNN method performed better than in a multivariate logistic model: validation MCC = 0.468 and training MCC = 0.868 for the present work compared to evaluation set MCC = 0.2036 and training set MCC = 0.5364 for the multivariate logistic regression on the full, unbalanced set. Techniques of this type may improve AR and toxicity description and prediction, improving assessment and design of compounds. Source code and data are available on github.
机译:可以改变人类和动物的雄激素受体途径的物质正在进入环境和食物链,并经过培养的能力破坏激素系统并导致对繁殖,脑发育和前列腺癌等毒性和不利影响。通过化学物质的人,黑猩猩和大鼠效应的实验数据的最先进的数据库已被用于构建机器学习分类器和回归器,并在独立集中评估这些。不同的特种术语,算法和蛋白质结构导致不同的结果,具有深度神经网络(DNN),用于为此工作开发的用户定义的物理化学相关特征,优于图形卷积,随机林和大型特种。结果表明,这些用户提供的结构 - ,配体和基于统计学的特征和特定的DNN提供了由AUC(0.87),MCC(0.47)和其他度量确定的最佳结果,以及它们的可解释性和化学意义描述符/功能。此外,与评估集MCC = 0.2036和训练MCC = 0.468和训练MCC = 0.868的验证MCC = 0.468和训练MCC = 0.2036,训练集MCC = 0.5364,验证MCC = 0.468和训练MCC = 0.868,以及用于多变量逻辑回归在完整,不平衡的集合上。这种类型的技术可以改善AR和毒性描述和预测,提高化合物的评估和设计。源代码和数据可在GitHub上获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号