...
首页> 外文期刊>Annals Data Science >Artificial Neural Network Classification of High Dimensional Data with Novel Optimization Approach of Dimension Reduction
【24h】

Artificial Neural Network Classification of High Dimensional Data with Novel Optimization Approach of Dimension Reduction

机译:新型的降维优化方法对高维数据进行人工神经网络分类

获取原文
获取原文并翻译 | 示例
           

摘要

Classification of high dimensional data is a very crucial task in bioinformatics. Cancer classification of the microarray is a typical application of machine learning due to the large numbers of genes. Feature (genes) selection and classification with computational intelligent techniques play an important role in diagnosis and prediction of disease in the microarray. Artificial neural networks (ANN) is an artificial intelligence technique for classifying, image processing and predicting the data. This paper evaluates the performance of ANN classifier using six different hybrid feature selection techniques, for gene selection of microarray data. These hybrid techniques use Independent component analysis (ICA), as an extraction technique, popular filter techniques and bio-inspired algorithm for optimization of the ICA feature vector. Five binary gene expression microarray datasets are used to compare the performance of these techniques and determine how these techniques improve the performance of ANN classifier. These techniques can be extremely useful in feature selection because they achieve the highest classification accuracy along with the lowest average number of selected genes. Furthermore, to check the significant difference between these different algorithms a statistical hypothesis test was employed with a certain level of confidence. The experimental result shows that a combination of ICA with genetic bee colony algorithm shows superior performance as it heuristically removes non-contributing features to improve the performance of classifiers.
机译:高维数据的分类是生物信息学中非常关键的任务。由于大量的基因,微阵列的癌症分类是机器学习的典型应用。利用计算智能技术进行特征(基因)选择和分类在微阵列中疾病的诊断和预测中起着重要作用。人工神经网络(ANN)是一种用于分类,图像处理和预测数据的人工智能技术。本文使用六种不同的混合特征选择技术评估ANN分类器用于微阵列数据基因选择的性能。这些混合技术使用独立成分分析(ICA)作为提取技术,流行的过滤器技术和生物启发算法来优化ICA特征向量。使用五个二元基因表达微阵列数据集来比较这些技术的性能,并确定这些技术如何提高ANN分类器的性能。这些技术在特征选择中非常有用,因为它们实现了最高的分类精度以及最低的所选基因平均数。此外,为了检查这些不同算法之间的显着差异,采用了具有一定置信度的统计假设检验。实验结果表明,ICA与遗传蜂群算法的结合表现出优越的性能,因为它启发式地删除了非贡献性特征,从而提高了分类器的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号