首页> 外文期刊>Bioengineered >A Gene selection approach based on the fisher linear discriminant and the neighborhood rough set
【24h】

A Gene selection approach based on the fisher linear discriminant and the neighborhood rough set

机译:基于fisher线性判别和邻域粗糙集的基因选择方法

获取原文
           

摘要

In recent years, tumor classification based on gene expression profiles has drawn great attention, and related research results have been widely applied to the clinical diagnosis of major gene diseases. These studies are of tremendous importance for accurate cancer diagnosis and subtype recognition. However, the microarray data of gene expression profiles have small samples, high dimensionality, large noise and data redundancy. To further improve the classification performance of microarray data, a gene selection approach based on the Fisher linear discriminant (FLD) and the neighborhood rough set (NRS) is proposed. First, the FLD method is employed to reduce the preliminarily genetic data to obtain features with a strong classification ability, which can form a candidate gene subset. Then, neighborhood precision and neighborhood roughness are defined in a neighborhood decision system, and the calculation approaches for neighborhood dependency and the significance of an attribute are given. A reduction model of neighborhood decision systems is presented. Thus, a gene selection algorithm based on FLD and NRS is proposed. Finally, four public gene datasets are used in the simulation experiments. Experimental results under the SVM classifier demonstrate that the proposed algorithm is effective, and it can select a smaller and more well-classified gene subset, as well as obtain better classification performance.
机译:近年来,基于基因表达谱的肿瘤分类备受关注,相关研究成果已广泛应用于重大基因疾病的临床诊断。这些研究对于准确的癌症诊断和亚型识别非常重要。然而,基因表达谱的微阵列数据具有小样本,高维度,大噪声和数据冗余。为了进一步提高微阵列数据的分类性能,提出了一种基于Fisher线性判别(FLD)和邻域粗糙集(NRS)的基因选择方法。首先,FLD方法被用来减少初步的遗传数据以获得具有强大分类能力的特征,这些特征可以形成候选基因子集。然后,在邻域决策系统中定义邻域精度和邻域粗糙度,并给出了邻域依赖性和属性重要性的计算方法。提出了邻域决策系统的约简模型。因此,提出了一种基于FLD和NRS的基因选择算法。最后,在模拟实验中使用了四个公共基因数据集。支持向量机分类器的实验结果表明,该算法是有效的,它可以选择更小,分类更好的基因子集,并获得更好的分类性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号