首页> 美国卫生研究院文献>other >A Robust Model-Free Feature Screening Method for Ultrahigh-Dimensional Data
【2h】

A Robust Model-Free Feature Screening Method for Ultrahigh-Dimensional Data

机译:超高维数据的鲁棒无模型特征筛选方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Feature screening plays an important role in dimension reduction for ultrahigh-dimensional data. In this paper, we introduce a new feature screening method and establish its sure independence screening property under the ultrahigh-dimensional setting. The proposed method works based on the nonparanormal transformation and Henze-Zirkler’s test; that is, it first transforms the response variable and features to Gaussian random variables using the nonparanormal transformation and then tests the dependence between the response variable and features using the Henze-Zirkler’s test. The proposed method enjoys at least two merits. First, it is model-free, which avoids the specification of a particular model structure. Second, it is condition-free, which does not require any extra conditions except for some regularity conditions for high-dimensional feature screening. The numerical results indicate that, compared to the existing methods, the proposed method is more robust to the data generated from heavy-tailed distributions and/or complex models with interaction variables. The proposed method is applied to screening of anticancer drug response genes.
机译:特征筛选在超高维数据的降维中起着重要作用。在本文中,我们介绍了一种新的特征筛选方法,并建立了它在超高维设置下的确定独立性筛选属性。该方法基于非超自然变换和Henze-Zirkler检验。也就是说,它首先使用非超自然变换将响应变量和特征转换为高斯随机变量,然后使用Henze-Zirkler检验测试响应变量和特征之间的相关性。所提出的方法至少具有两个优点。首先,它是无模型的,从而避免了特定模型结构的规范。其次,它是无条件的,除了用于高维特征筛选的某些规则性条件外,不需要任何其他条件。数值结果表明,与现有方法相比,所提出的方法对于由重尾分布和/或具有交互变量的复杂模型生成的数据更健壮。该方法应用于抗癌药物反应基因的筛选。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号