Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space

机译：平行和分布式汤普森采样，大规模加速勘探化学空间

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Chemical space is so large that brute force searches for new interesting molecules are infeasible. High-throughput virtual screening via computer cluster simulations can speed up the discovery process by collecting very large amounts of data in parallel, e.g., up to hundreds or thousands of parallel measurements. Bayesian optimization (BO) can produce additional acceleration by sequentially identifying the most useful simulations or experiments to be performed next. However, current BO methods cannot scale to the large numbers of parallel measurements and the massive libraries of molecules currently used in high-throughput screening. Here, we propose a scalable solution based on a parallel and distributed implementation of Thompson sampling (PDTS). We show that, in small scale problems, PDTS performs similarly as parallel expected improvement (EI), a batch version of the most widely used BO heuristic. Additionally, in settings where parallel EI does not scale, PDTS outperforms other scalable baselines such as a greedy search, ε-greedy approaches and a random search method. These results show that PDTS is a successful solution for large-scale parallel BO.

机译：化学空间太大，以至于对新有趣的分子进行蛮力搜索是不可行的。通过计算机群集模拟的高吞吐量虚拟筛选可以通过并行收集非常大量的数据来加速发现过程，例如，高达数百或数千个并行测量。贝叶斯优化（BO）可以通过顺序识别下次执行的最有用的模拟或实验来产生额外的加速度。然而，当前BO方法不能扩展到大量的并行测量和当前用于高通量筛选的分子的大规模文库。在这里，我们提出了一种基于汤普森采样（PDT）的平行和分布式实施方式的可扩展解决方案。我们表明，在小规模问题中，PDTS类似地作为平行预期改进（EI），批量版本最广泛使用的BO启发式。另外，在并行EI不扩展的设置中，PDTS优于其他可伸缩的基线，例如贪婪搜索，ε-贪婪的方法和随机搜索方法。这些结果表明，PDT是大规模平行博的成功解决方案。

著录项

来源
《International Conference on Machine Learning》|2018年|1597-2390p|共10页
会议地点
作者
Jose Miguel Hernandez-Lobato; James Requeima; Edward O. Pyzer-Knapp; Alan Aspuru-Guzik;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词

相似文献

外文文献
中文文献
专利

1. Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space [J] . José Miguel Hernández-Lobato, James Requeima, Edward O. Pyzer-Knapp, JMLR: Workshop and Conference Proceedings . 2017,第3期

机译：大规模并行探索化学空间的并行和分布式汤普森采样
2. Thompson-Howarth error analysis: unbiased alternatives to the large-sample method for assessing non-normally distributed measurement error in geochemical samples [J] . Stanley CR, Lawie D Geochemistry: exploration, environment, analysis . 2008,第2期

机译：汤普森-霍华斯误差分析：评估地球化学样品中非正态分布测量误差的大样本方法的无偏方案
3. Adaptive Landscape Flattening Accelerates Sampling of Alchemical Space in Multisite lambda Dynamics [J] . Hayes Ryan L., Armacost Kira A., Vilseck Jonah Z., The journal of physical chemistry, B. Condensed matter, materials, surfaces, interfaces & biophysical . 2017,第15期

机译：自适应景观平整加速了多路λ动态中炼金术空间的采样
4. Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space [C] . Jose Miguel Hernandez-Lobato, James Requeima, Edward O. Pyzer-Knapp, International Conference on Machine Learning . 2018

机译：平行和分布式汤普森采样，大规模加速勘探化学空间
5. Nonlinear Encoding MRI: Multi-slice and Oblique O-space Imaging, Null Space Imaging, and Pseudo-random O-space Imaging for Accelerated Parallel Imaging. [D] . Tam, Lick-Kong. 2013

机译：非线性编码MRI：用于加速并行成像的多层和倾斜O空间成像，零空间成像和伪随机O空间成像。
6. Adaptive Landscape Flattening Accelerates Sampling of Alchemical Space in Multisite λ Dynamics [O] . Ryan L. Hayes, Kira A. Armacost, Jonah Z. Vilseck, -1

机译：自适应景观展平加速多站点λ动力学中的炼金空间采样
7. Accelerating the Requirement Space Exploration through Coarse-Grained Parallel Execution [O] . Lin, Zhongwei, Yao, Yiping 2011

机译：通过粗粒度并行执行加速需求空间探索
8. Analytical Data for Reconnaissance Geochemical Samples from Mine Dumps, Stream Sediments, and Waters at the Thompson Creek Tungsten Mine, Custer County, Idaho. [R] . Van Gosen, B. S., Eppinger, R. G., Hammarstrom, J. M., 2000

机译：来自爱达荷州卡斯特县汤普森溪钨矿的矿山倾倒，河流沉积物和水域的勘测地球化学样品的分析数据。

Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space

摘要

著录项

相似文献

相关主题

期刊订阅