A class comparison method with filtering-enhanced variable selection for high-dimensional data sets.

Lusa L; Korn EL; McShane LM

首页> 外文期刊>Statistics in medicine >A class comparison method with filtering-enhanced variable selection for high-dimensional data sets.

【24h】

A class comparison method with filtering-enhanced variable selection for high-dimensional data sets.

机译：用于高维数据集的具有过滤增强型变量选择的类比较方法。

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

High-throughput molecular analysis technologies can produce thousands of measurements for each of the assayed samples. A common scientific question is to identify the variables whose distributions differ between some pre-specified classes (i.e. are differentially expressed). The statistical cost of examining thousands of variables is related to the risk of identifying many variables that truly are not differentially expressed, and many different multiple testing strategies have been used for the analysis of high-dimensional data sets to control the number of these false positives. An approach that is often used in practice to reduce the multiple comparisons problem is to lessen the number of comparisons being performed by filtering out variables that are considered non-informative 'before' the analysis. However, deciding which and how many variables should be filtered out can be highly arbitrary, and different filtering strategies can result in different variables being identified as differentially expressed. We propose the filtering-enhanced variable selection (FEVS) method, a new multiple testing strategy for identifying differentially expressed variables. This method identifies differentially expressed variables by combining the results obtained using a variety of filtering methods, instead of using a pre-specified filtering method or trying to identify an optimal filtering of the variables prior to class comparison analysis. We prove that the FEVS method probabilistically controls the number of false discoveries, and we show with a set of simulations and an example from the literature that FEVS can be useful for gaining sensitivity for the detection of truly differentially expressed variables. Published in 2008 by John Wiley & Sons, Ltd.

机译：高通量分子分析技术可以为每个被分析的样品进行数千次测量。一个常见的科学问题是确定变量的分布在某些预先指定的类之间是不同的（即差异表达）。检查成千上万个变量的统计成本与确定许多真正没有差异表达的变量的风险有关，并且许多不同的多重测试策略已用于分析高维数据集以控制这些误报的数量。。在实践中通常用于减少多重比较问题的一种方法是通过过滤掉在分析之前被认为是非信息性的变量来减少正在执行的比较次数。但是，决定应该滤除哪些变量和多少变量可能是高度任意的，并且不同的过滤策略可能会导致不同的变量被标识为差异表达。我们提出了过滤增强型变量选择（FEVS）方法，这是一种用于识别差异表达变量的新的多重测试策略。该方法通过组合使用各种过滤方法获得的结果来识别差异表达的变量，而不是使用预先指定的过滤方法或尝试在类比较分析之前尝试确定变量的最佳过滤。我们证明了FEVS方法可以概率性地控制错误发现的数量，并通过一组模拟和文献中的一个例子表明FEVS可以用于提高检测真正差异表达变量的灵敏度。 John Wiley＆Sons，Ltd.于2008年出版。

著录项

来源
《Statistics in medicine》 |2008年第28期|共16页
作者
Lusa L; Korn EL; McShane LM;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类保健组织与事业（卫生事业管理）;
关键词
Analysis of substances; data set; Selection (Genetics); Testing; 选择(遗传学);

机译：Analysis of substances;data set;Selection (Genetics);Testing;选择(遗传学);

相似文献

外文文献
中文文献
专利

1. A class comparison method with filtering-enhanced variable selection for high-dimensional data sets. [J] . Lusa L, Korn EL, McShane LM Statistics in medicine . 2008,第28期

机译：用于高维数据集的具有过滤增强型变量选择的类比较方法。
2. Comparison of variable selection methods for high-dimensional survival data with competing events [J] . Julia Gilhodes, Christophe Zemmour, Soufiane Ajana, Computers in Biology and Medicine . 2017,第期

机译：竞争事件的高维生存数据变量选择方法的比较
3. New Variable Selection Method Using Interval Segmentation Purity with Application to Blockwise Kernel Transform Support Vector Machine Classification of High-Dimensional Microarray Data [J] . Tang Li-Juan, Du Wen, Fu Hai-Yan, Journal of chemical information and modeling . 2009,第8期

机译：区间分割纯度的变量选择新方法在高维微阵列数据分块核变换支持向量机分类中的应用
4. Supervised Feature Selection Method for High-Dimensional Data Classification in Photo-Thermal Infrared Imaging with Limited Training Data [C] . Nian Zhang, Keenan Leatham International Conference on Control, Decision and Information Technologies . 2018

机译：有限训练数据的光热红外成像中高维数据分类的有监督特征选择方法
5. Pre-processing methods and stepwise variable selection for binary classification of high-dimensional data. [D] . Ramachandar, Shahla. 2010

机译：高维数据二进制分类的预处理方法和逐步变量选择。
6. Comparison of Variable Selection Methods for Time-to-Event Data in High-Dimensional Settings [O] . Julia Gilhodes, Florence Dalenc, Jocelyn Gal, 2020

机译：在高维设置中对时间 - 事件时间数据的变量选择方法的比较
7. A Class Comparison Method with Filtering Enhanced Variable Selection for High-Dimensional Data Sets [O] . Lara Lusa, Edward L. Korn, Lisa M. Mcshane 2015

机译：一种用于高维数据集的滤波增强变量选择的类比较方法
8. Novel Texture-based Visualization Methods for High-dimensional Multi- field Data Sets. [R] . B. Wuensche 2013

机译：基于纹理的高维多场数据集可视化方法。

A class comparison method with filtering-enhanced variable selection for high-dimensional data sets.

摘要

著录项

相似文献

相关主题

期刊订阅