首页> 外文会议>International work-conference on bioinformatics and biomedical engineering >Simultaneous Gene Selection and Weighting in Nearest Neighbor Classifier for Gene Expression Data
【24h】

Simultaneous Gene Selection and Weighting in Nearest Neighbor Classifier for Gene Expression Data

机译:最近邻分类器中同时进行基因选择和加权的基因表达数据

获取原文

摘要

Gene expression data, such as microarray data, plays an important role as a biomarker in order to help in the effective cancer diagnosis, tumor classification or drug design at molecular level. However, due to high-dimensionality of microarray datasets, they tend to have irrelevant or redundant features, and may lead to poor classification performance. For this reason, feature selection methods are commonly used to reduce the amount of data and to select relevant genes to improve the accuracy of machine learning methods. In this paper, a simultaneous feature ranking and weighting gene selection method with a nearest neighbor-based classifier is presented. In order to demonstrate the effectiveness of this proposal, a range of experiments over four well-known microarray datasets were carried out. Results showed that our method outperforms previous methods in terms of classification accuracy. Furthermore, evidence for significance of our results by means of non-parametric Friedman test is provided.
机译:基因表达数据(例如微阵列数据)作为生物标记物起着重要作用,以帮助在分子水平上进行有效的癌症诊断,肿瘤分类或药物设计。然而,由于微阵列数据集的高维性,它们倾向于具有不相关或多余的特征,并可能导致较差的分类性能。因此,特征选择方法通常用于减少数据量并选择相关基因以提高机器学习方法的准确性。本文提出了一种基于最近邻分类器的同时特征排序和加权基因选择方法。为了证明该建议的有效性,对四个著名的微阵列数据集进行了一系列实验。结果表明,在分类准确度方面,我们的方法优于以前的方法。此外,还提供了通过非参数弗里德曼检验证明我们的结果具有重要意义的证据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号