...
首页> 外文期刊>Chemical science >Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning
【24h】

Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning

机译:贝叶斯半监督学习用于不确定性校准的分子特性预测和主动学习

获取原文

摘要

Predicting bioactivity and physical properties of small molecules is a central challenge in drug discovery. Deep learning is becoming the method of choice but studies to date focus on mean accuracy as the main metric. However, to replace costly and mission-critical experiments by models, a high mean accuracy is not enough: outliers can derail a discovery campaign, thus models need to reliably predict when it will fail, even when the training data is biased; experiments are expensive, thus models need to be data-efficient and suggest informative training sets using active learning. We show that uncertainty quantification and active learning can be achieved by Bayesian semi-supervised graph convolutional neural networks. The Bayesian approach estimates uncertainty in a statistically principled way through sampling from the posterior distribution. Semi-supervised learning disentangles representation learning and regression, keeping uncertainty estimates accurate in the low data limit and allowing the model to start active learning from a small initial pool of training data. Our study highlights the promise of Bayesian deep learning for chemistry.
机译:预测小分子的生物活性和物理性质是药物发现中的主要挑战。深度学习正成为一种选择的方法,但迄今为止,研究一直以平均准确度为主要指标。但是,要用模型代替昂贵且关键任务的实验,平均准确性就不够了:离群值可能会使发现活动脱轨,因此模型需要可靠地预测何时会失败,即使训练数据有偏差也是如此。实验是昂贵的,因此模型需要提高数据效率,并使用主动学习来提供有益的训练集。我们表明,贝叶斯半监督图卷积神经网络可以实现不确定性量化和主动学习。贝叶斯方法通过从后验分​​布中进行抽样,以统计学上合理的方式估计不确定性。半监督学习可解开表示学习和回归问题,使不确定性估计在低数据限制内保持准确,并允许模型从少量初始训练数据开始主动学习。我们的研究突出了贝叶斯深度学习对化学的承诺。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号