【24h】

Monte Carlo randomization tests for large-scale abundance datasets on the GPU.

机译:在GPU上对大规模丰度数据集进行蒙特卡洛随机测试。

获取原文
获取原文并翻译 | 示例
           

摘要

Statistical tests are often performed to discover which experimental variables are reacting to specific treatments. Time-series statistical models usually require the researcher to make assumptions with respect to the distribution of measured responses which may not hold. Randomization tests can be applied to data in order to generate null distributions non-parametrically. However, large numbers of randomizations are required for the precise p-values needed to control false discovery rates. When testing tens of thousands of variables (genes, chemical compounds, or otherwise), significant q-value cutoffs can be extremely small (on the order of 10(-5) to 10(-8)). This requires high-precision p-values, which in turn require large numbers of randomizations. The NVIDIA((R)) Compute Unified Device Architecture((R)) (CUDA((R))) platform for General Programming on the Graphics Processing Unit (GPGPU) was used to implement an application which performs high-precision randomization tests via Monte Carlo sampling for quickly screening custom test statistics for experiments with large numbers of variables, such as microarrays, Next-Generation sequencing read counts, chromatographical signals, or other abundance measurements. The software has been shown to achieve up to more than 12 fold speedup on a Graphics Processing Unit (GPU) when compared to a powerful Central Processing Unit (CPU). The main limitation is concurrent random access of shared memory on the GPU. The software is available from the authors.
机译:经常进行统计测试以发现哪些实验变量对特定处理产生了反应。时间序列统计模型通常要求研究人员对可能不成立的测量响应的分布进行假设。可以将随机测试应用于数据,以便非参数地生成零分布。但是,为控制错误发现率所需的精确p值需要大量随机化。当测试数以万计的变量(基因,化合物或其他)时,q值的临界值可能会非常小(约为10(-5)到10(-8))。这需要高精度的p值,进而需要大量的随机化。用于图形处理单元(GPGPU)通用编程的NVIDIA Compute Unified设备体系结构(CUDA(R))平台用于实现可通过以下方式执行高精度随机测试的应用程序:蒙特卡洛采样用于快速筛选定制测试统计数据,以进行具有大量变量的实验,例如微阵列,下一代测序读取计数,色谱信号或其他丰度测量。与强大的中央处理器(CPU)相比,该软件在图形处理器(GPU)上的显示速度最高可提高12倍。主要限制是GPU上对共享内存的并发随机访问。该软件可从作者处获得。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号