...
首页> 外文期刊>Journal of Computational Chemistry: Organic, Inorganic, Physical, Biological >Message passing interface and multithreading hybrid for parallel molecular docking of large databases on petascale high performance computing machines
【24h】

Message passing interface and multithreading hybrid for parallel molecular docking of large databases on petascale high performance computing machines

机译:消息传递接口和多线程混合,用于在Petascale高性能计算机上将大型数据库并行分子对接

获取原文
获取原文并翻译 | 示例
           

摘要

A mixed parallel scheme that combines message passing interface (MPI) and multithreading was implemented in the AutoDock Vina molecular docking program. The resulting program, named VinaLC, was tested on the petascale high performance computing (HPC) machines at Lawrence Livermore National Laboratory. To exploit the typical cluster-type supercomputers, thousands of docking calculations were dispatched by the master process to run simultaneously on thousands of slave processes, where each docking calculation takes one slave process on one node, and within the node each docking calculation runs via multithreading on multiple CPU cores and shared memory. Input and output of the program and the data handling within the program were carefully designed to deal with large databases and ultimately achieve HPC on a large number of CPU cores. Parallel performance analysis of the VinaLC program shows that the code scales up to more than 15K CPUs with a very low overhead cost of 3.94%. One million flexible compound docking calculations took only 1.4 h to finish on about 15K CPUs. The docking accuracy of VinaLC has been validated against the DUD data set by the re-docking of X-ray ligands and an enrichment study, 64.4% of the top scoring poses have RMSD values under 2.0 ?. The program has been demonstrated to have good enrichment performance on 70% of the targets in the DUD data set. An analysis of the enrichment factors calculated at various percentages of the screening database indicates VinaLC has very good early recovery of actives.
机译:AutoDock Vina分子对接程序中实现了将消息传递接口(MPI)和多线程结合在一起的混合并行方案。所得的名为VinaLC的程序已在劳伦斯·利弗莫尔国家实验室的PB级高性能计算(HPC)机器上进行了测试。为了利用典型的群集型超级计算机,主进程调度了数千个对接计算,以便在数千个从属进程上同时运行,其中每个对接计算在一个节点上占用一个从属进程,并且在节点内,每个对接计算都通过多线程运行在多个CPU内核和共享内存上。程序的输入和输出以及程序中的数据处理经过精心设计,以处理大型数据库,并最终在大量CPU内核上实现HPC。对VinaLC程序的并行性能分析表明,该代码可以扩展到超过15,000个CPU,而开销成本仅为3.94%,非常低。在大约15,000个CPU上,一百万次灵活的化合物对接计算仅花费了1.4小时。通过对X射线配体的重新对接和富集研究,已针对DUD数据集验证了VinaLC的对接准确性,并进行了富集研究,其中64.4%的最高得分姿势的RMSD值均低于2.0。已经证明该程序对DUD数据集中70%的目标具有良好的浓缩性能。对在筛选数据库的各个百分比处计算出的富集因子的分析表明,VinaLC具有非常好的早期活性物质回收率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号