...
首页> 外文期刊>Statistics in medicine >A class comparison method with filtering-enhanced variable selection for high-dimensional data sets.
【24h】

A class comparison method with filtering-enhanced variable selection for high-dimensional data sets.

机译:用于高维数据集的具有过滤增强型变量选择的类比较方法。

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

High-throughput molecular analysis technologies can produce thousands of measurements for each of the assayed samples. A common scientific question is to identify the variables whose distributions differ between some pre-specified classes (i.e. are differentially expressed). The statistical cost of examining thousands of variables is related to the risk of identifying many variables that truly are not differentially expressed, and many different multiple testing strategies have been used for the analysis of high-dimensional data sets to control the number of these false positives. An approach that is often used in practice to reduce the multiple comparisons problem is to lessen the number of comparisons being performed by filtering out variables that are considered non-informative 'before' the analysis. However, deciding which and how many variables should be filtered out can be highly arbitrary, and different filtering strategies can result in different variables being identified as differentially expressed. We propose the filtering-enhanced variable selection (FEVS) method, a new multiple testing strategy for identifying differentially expressed variables. This method identifies differentially expressed variables by combining the results obtained using a variety of filtering methods, instead of using a pre-specified filtering method or trying to identify an optimal filtering of the variables prior to class comparison analysis. We prove that the FEVS method probabilistically controls the number of false discoveries, and we show with a set of simulations and an example from the literature that FEVS can be useful for gaining sensitivity for the detection of truly differentially expressed variables. Published in 2008 by John Wiley & Sons, Ltd.
机译:高通量分子分析技术可以为每个被分析的样品进行数千次测量。一个常见的科学问题是确定变量的分布在某些预先指定的类之间是不同的(即差异表达)。检查成千上万个变量的统计成本与确定许多真正没有差异表达的变量的风险有关,并且许多不同的多重测试策略已用于分析高维数据集以控制这些误报的数量。 。在实践中通常用于减少多重比较问题的一种方法是通过过滤掉在分析之前被认为是非信息性的变量来减少正在执行的比较次数。但是,决定应该滤除哪些变量和多少变量可能是高度任意的,并且不同的过滤策略可能会导致不同的变量被标识为差异表达。我们提出了过滤增强型变量选择(FEVS)方法,这是一种用于识别差异表达变量的新的多重测试策略。该方法通过组合使用各种过滤方法获得的结果来识别差异表达的变量,而不是使用预先指定的过滤方法或尝试在类比较分析之前尝试确定变量的最佳过滤。我们证明了FEVS方法可以概率性地控制错误发现的数量,并通过一组模拟和文献中的一个例子表明FEVS可以用于提高检测真正差异表达变量的灵敏度。 John Wiley&Sons,Ltd.于2008年出版。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号