首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space
【24h】

Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space

机译:大规模并行探索化学空间的并行和分布式汤普森采样

获取原文
           

摘要

Chemical space is so large that brute force searches for new interesting molecules are infeasible. High-throughput virtual screening via computer cluster simulations can speed up the discovery process by collecting very large amounts of data in parallel, e.g., up to hundreds or thousands of parallel measurements. Bayesian optimization (BO) can produce additional acceleration by sequentially identifying the most useful simulations or experiments to be performed next. However, current BO methods cannot scale to the large numbers of parallel measurements and the massive libraries of molecules currently used in high-throughput screening. Here, we propose a scalable solution based on a parallel and distributed implementation of Thompson sampling (PDTS). We show that, in small scale problems, PDTS performs similarly as parallel expected improvement (EI), a batch version of the most widely used BO heuristic. Additionally, in settings where parallel EI does not scale, PDTS outperforms other scalable baselines such as a greedy search, $epsilon$-greedy approaches and a random search method. These results show that PDTS is a successful solution for large-scale parallel BO.
机译:化学空间是如此之大,以至于无法用蛮力搜索出新的有趣分子。通过计算机集群模拟进行的高通量虚拟筛选可以通过并行收集非常大量的数据(例如,多达数百或数千个并行测量)来加快发现过程。贝叶斯优化(BO)可以通过顺序确定接下来要执行的最有用的模拟或实验来产生额外的加速度。但是,当前的BO方法无法扩展到大量并行测量以及当前在高通量筛选中使用的大量分子库。在这里,我们提出了基于汤普森采样(PDTS)的并行和分布式实现的可伸缩解决方案。我们表明,在小规模问题中,PDTS的表现与并行预期改进(EI)相似,后者是最广泛使用的BO启发式方法的批处理版本。此外,在并行EI无法缩放的设置中,PDTS的性能优于其他可扩展的基线,例如贪婪搜索,ε贪婪方法和随机搜索方法。这些结果表明,PDTS是大规模并行BO的成功解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号