首页> 外文会议>International Conference on Machine Learning >Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space
【24h】

Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space

机译:平行和分布式汤普森采样,大规模加速勘探化学空间

获取原文

摘要

Chemical space is so large that brute force searches for new interesting molecules are infeasible. High-throughput virtual screening via computer cluster simulations can speed up the discovery process by collecting very large amounts of data in parallel, e.g., up to hundreds or thousands of parallel measurements. Bayesian optimization (BO) can produce additional acceleration by sequentially identifying the most useful simulations or experiments to be performed next. However, current BO methods cannot scale to the large numbers of parallel measurements and the massive libraries of molecules currently used in high-throughput screening. Here, we propose a scalable solution based on a parallel and distributed implementation of Thompson sampling (PDTS). We show that, in small scale problems, PDTS performs similarly as parallel expected improvement (EI), a batch version of the most widely used BO heuristic. Additionally, in settings where parallel EI does not scale, PDTS outperforms other scalable baselines such as a greedy search, ε-greedy approaches and a random search method. These results show that PDTS is a successful solution for large-scale parallel BO.
机译:化学空间太大,以至于对新有趣的分子进行蛮力搜索是不可行的。通过计算机群集模拟的高吞吐量虚拟筛选可以通过并行收集非常大量的数据来加速发现过程,例如,高达数百或数千个并行测量。贝叶斯优化(BO)可以通过顺序识别下次执行的最有用的模拟或实验来产生额外的加速度。然而,当前BO方法不能扩展到大量的并行测量和当前用于高通量筛选的分子的大规模文库。在这里,我们提出了一种基于汤普森采样(PDT)的平行和分布式实施方式的可扩展解决方案。我们表明,在小规模问题中,PDTS类似地作为平行预期改进(EI),批量版本最广泛使用的BO启发式。另外,在并行EI不扩展的设置中,PDTS优于其他可伸缩的基线,例如贪婪搜索,ε-贪婪的方法和随机搜索方法。这些结果表明,PDT是大规模平行博的成功解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号