首页> 外文期刊>Knowledge-Based Systems >A fuzzy intelligent approach to the classification problem in gene expression data analysis
【24h】

A fuzzy intelligent approach to the classification problem in gene expression data analysis

机译:基因表达数据分析中分类问题的模糊智能方法

获取原文
获取原文并翻译 | 示例
           

摘要

Classification is an important data mining task that widely used in several different real world applica tions. In microarray analysis, classification techniques are applied in order to discriminate diseases or to predict outcomes based on gene expression patterns, and perhaps even to identify the best treatment for given genetic signature. The most important challenge in gene expression data analysis lies in how to deal with its unique "high dimension small sample" characteristic, which makes many traditional clas sification techniques non-applicable or inefficient; and hence, more dedicated techniques are nowadays needed in order to approach this problem. Fuzzy logic is recently shown that is a powerful and suitable soft computing tool for handling the complex problems under incomplete data conditions. In this paper, a new hybrid model is proposed that combines artificial intelligence with fuzzy in order to benefit from unique advantages of both fuzzy logic and the classification power of the artificial neural networks (ANNs), to construct an efficient and accurate hybrid classifier in less available data situations. The pro posed model, because of using the fuzzy parameters instead of the crisp parameters, will need less data set in comparing with traditional nonfuzzy neural networks in its training process or with same training sample can better learn and hence can yield more accurate results than traditional neural networks. In addition of theoretical evidence of using fuzzy logic, empirical results of gene expression classification indicate that the proposed model exhibits effectively improved classification accuracy in comparison with traditional artificial neural networks (ANNs) and also some other well-known statistical and intel ligent classification models such as the linear discriminant analysis (LDA), the quadratic discriminant analysis (QDA), the K-nearest neighbor (KNN), and the support vector machines (SVMs). Therefore, the proposed model can be applied as an appropriate alternate approach for solving problems with scant data such as gene expression data classification, specifically when higher classification accuracy is needed.
机译:分类是一项重要的数据挖掘任务,已广泛应用于几种不同的实际应用中。在微阵列分析中,应用分类技术是为了根据基因表达模式区分疾病或预测结果,甚至可能针对给定的遗传特征识别最佳治疗方法。基因表达数据分析中最重要的挑战在于如何应对其独特的“高维小样本”特征,这使得许多传统分类技术不适用或效率低下。因此,如今需要更专用的技术来解决这个问题。最近显示了模糊逻辑,它是处理不完整数据条件下的复杂问题的强大而合适的软计算工具。本文提出了一种新的混合模型,该模型将人工智能与模糊相结合,以利用模糊逻辑和人工神经网络(ANN)的分类能力的独特优势,以更少的成本构造出高效,准确的混合分类器。可用数据情况。与传统的非模糊神经网络相比,该模型由于使用了模糊参数而不是清晰的参数,因此在训练过程中需要较少的数据集,或者使用相同的训练样本可以更好地学习,因此比传统的方法可以获得更准确的结果。神经网络。除了使用模糊逻辑的理论证据外,基因表达分类的经验结果表明,与传统的人工神经网络(ANN)以及其他一些著名的统计和智能分类模型相比,该模型具有更高的分类精度。作为线性判别分析(LDA),二次判别分析(QDA),K最近邻(KNN)和支持向量机(SVM)。因此,提出的模型可以用作解决基因表达数据分类等数据稀少问题的合适替代方法,特别是在需要更高分类精度的情况下。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号