首页> 美国卫生研究院文献>Oxford Open >Weighted mining of massive collections of ... formula ...-values by convex optimization
【2h】

Weighted mining of massive collections of ... formula ...-values by convex optimization

机译:通过凸优化对... ... ...值的大量集合进行加权挖掘

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Researchers in data-rich disciplines—think of computational genomics and observational cosmology—often wish to mine large bodies of -values looking for significant effects, while controlling the false discovery rate or family-wise error rate. Increasingly, researchers also wish to prioritize certain hypotheses, for example, those thought to have larger effect sizes, by upweighting, and to impose constraints on the underlying mining, such as monotonicity along a certain sequence. We introduce Princessp, a principled method for performing weighted multiple testing by constrained convex optimization. Our method elegantly allows one to prioritize certain hypotheses through upweighting and to discount others through downweighting, while constraining the underlying weights involved in the mining process. When the -values derive from monotone likelihood ratio families such as the Gaussian means model, the new method allows exact solution of an important optimal weighting problem previously thought to be non-convex and computationally infeasible. Our method scales to massive data set sizes. We illustrate the applications of Princessp on a series of standard genomics data sets and offer comparisons with several previous ‘standard’ methods. Princessp offers both ease of operation and the ability to scale to extremely large problem sizes. The method is available as open-source software from (accessed 11 October 2017).
机译:数据丰富的学科(例如计算基因组学和观测宇宙学)的研究人员通常希望挖掘大量的值以寻找显着效果,同时控制错误发现率或家庭错误率。研究人员也越来越希望对某些假设进行优先排序,例如,通过增加权重,认为这些假设具有较大的影响大小,并对基础挖掘施加约束,例如沿特定序列的单调性。我们介绍Princessp,这是一种通过约束凸优化执行加权多重测试的原理方法。我们的方法优雅地允许一个人通过增加权重来确定某些假设的优先级,并通过减少权重来折衷其他假设,同时限制挖掘过程中涉及的基本权重。当-值从诸如高斯均值模型之类的单调似然比族派生时,该新方法可以精确解决以前认为是非凸且计算上不可行的重要最佳加权问题。我们的方法可以扩展到海量数据集的大小。我们将说明Princessp在一系列标准基因组学数据集上的应用,并与几种以前的“标准”方法进行比较。 Princessp既易于操作,又可以扩展到非常大的问题大小。该方法可从(2017年10月11日访问)作为开源软件获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号