首页> 外文会议>International Conference on Advanced Computer Science and Information Systems >Sequential forward floating selection with two selection criteria
【24h】

Sequential forward floating selection with two selection criteria

机译:用两个选择标准顺序前向浮动选择

获取原文

摘要

Feature selection has been an active area of research for decades. In 1977, Thomas M. Cover and Jan M. Van Capenhout showed that only exhaustive search can guarantee the best combination of features, but it is costly in terms of computational resources and time. This work proposes the use of two selection criteria in a stepwise search method, i.e., sequential forward floating selection algorithm which wraps support vector regression, and compares the results obtained by two of multipurpose kernels for high-dimensional linear regression problem. Adjusted R2 and mean squared error are used as optimality or selection criteria. One of the many areas which make heavy use of feature selection techniques is bioinformatics. Genome Wide Association Studies in bioinformatics aims at determining whether a genetic variant is associated with a certain phenotype. Single nucleotide polymorphism (SNP) is the most popular marker used to identify genetic polymorphisms. Testing of the proposed method for variable selection in high-dimensional linear regression was conducted using two simulated SNP datasets generated by the `scrime' package and in low-dimensional linear regression using a dataset from the `UsingR' package in R. Our results show that the intersection of the two selected subsets produced by the two selection criteria can reduce the number of false positives.
机译:特征选择是几十年的活跃的研究领域。 1977年,Thomas M. Cover和Jan M.Van Capenhout显示,只有详尽的搜索可以保证最佳功能组合,但在计算资源和时间方面昂贵。这项工作提出了在逐步搜索方法中使用两个选择标准,即汇总支持向量回归的顺序前进浮动选择算法,并将通过两个多功能内核获得的结果进行比较,用于高维线性回归问题。调整后的R 2 和均方误差用作最优性或选择标准。大量使用特征选择技术的众多领域之一是生物信息学。生物信息学的基因组宽协会研究旨在确定遗传变异是否与某种表型相关。单核苷酸多态性(SNP)是用于鉴定遗传多态性的最流行的标记物。使用由`scrime'包生成的两个模拟的SNP数据集和使用来自R.在R中的“使用R”包中的数据集中的两种模拟的SNP数据集来测试高维线性回归中的可变选择方法。由两个选择标准产生的两个所选子集的交叉点可以减少误报的数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号