首页> 美国卫生研究院文献>other >An efficient approach for feature construction of high-dimensional microarray data by random projections
【2h】

An efficient approach for feature construction of high-dimensional microarray data by random projections

机译:通过随机投影构建高维微阵列数据特征的有效方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Dimensionality reduction of microarray data is a very challenging task due to high computational time and the large amount of memory required to train and test a model. Genetic programming (GP) is a stochastic approach to solving a problem. For high dimensional datasets, GP does not perform as well as other machine learning algorithms. To explore the inherent property of GP to generalize models from low dimensional data, we need to consider dimensionality reduction approaches. Random projections (RPs) have gained attention for reducing the dimensionality of data with reduced computational cost, compared to other dimensionality reduction approaches. We report that the features constructed from RPs perform extremely well when combined with a GP approach. We used eight datasets out of which seven have not been reported as being used in any machine learning research before. We have also compared our results by using the same full and constructed features for decision trees, random forest, naive Bayes, support vector machines and k-nearest neighbor methods.
机译:由于高计算时间以及训练和测试模型所需的大量内存,减少微阵列数据的维数是一项非常具有挑战性的任务。基因编程(GP)是一种解决问题的随机方法。对于高维数据集,GP的性能不如其他机器学习算法好。为了探索GP的固有属性以从低维数据中泛化模型,我们需要考虑降维方法。与其他降维方法相比,随机投影(RPs)已引起人们的注意,它以降低的计算成本降低了数据的维数。我们报告说,与GP方法结合使用时,由RP构造的功能性能非常好。我们使用了八个数据集,其中七个尚未被报道用于任何机器学习研究。我们还通过对决策树,随机森林,朴素贝叶斯,支持向量机和k最近邻方法使用相同的完整特征和构造特征来比较我们的结果。

著录项

  • 期刊名称 other
  • 作者单位
  • 年(卷),期 -1(13),4
  • 年度 -1
  • 页码 e0196385
  • 总页数 8
  • 原文格式 PDF
  • 正文语种
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号