首页> 外文期刊>Artificial Intelligence Review: An International Science and Engineering Journal >An empirical evaluation of hierarchical feature selection methods for classification in bioinformatics datasets with gene ontology-based features
【24h】

An empirical evaluation of hierarchical feature selection methods for classification in bioinformatics datasets with gene ontology-based features

机译:基于基于基于基于基于基于基于基于基于基于基于基于基于基于基于基于基于基于基于基于词组的分层特征选择方法的实证评估

获取原文
获取原文并翻译 | 示例
           

摘要

Hierarchical feature selection is a new research area in machine learning/data mining, which consists of performing feature selection by exploiting dependency relationships among hierarchically structured features. This paper evaluates four hierarchical feature selection methods, i.e., HIP, MR, SHSEL and GTD, used together with four types of lazy learning-based classifiers, i.e., Na ve Bayes, Tree Augmented Na ve Bayes, Bayesian Network Augmented Na ve Bayes and k-Nearest Neighbors classifiers. These four hierarchical feature selection methods are compared with each other and with a well-known "flat" feature selection method, i.e., Correlation-based Feature Selection. The adopted bioinformatics datasets consist of aging-related genes used as instances and Gene Ontology terms used as hierarchical features. The experimental results reveal that the HIP (Select Hierarchical Information Preserving Features) method performs best overall, in terms of predictive accuracy and robustness when coping with data where the instances' classes have a substantially imbalanced distribution. This paper also reports a list of the Gene Ontology terms that were most often selected by the HIP method.
机译:分层特征选择是机器学习/数据挖掘的新研究区域,它包括通过利用分层结构化功能之间的依赖关系来执行特征选择。本文评估了四个分层特征选择方法,即HIP,MR,SHSEL和GTD,与四种类型的基于懒惰学习的分类器一起使用,即NA&贝斯,树增强Na&贝叶斯网络增强Na + ve Bayes和K-最近的邻居分类器。将这四个分层特征选择方法彼此进行比较,并且具有众所周知的“扁平”特征选择方法,即基于相关的特征选择。所采用的生物信息学数据集由用作使用作为分层特征的实例和基因本体术语的老化相关基因组成。实验结果表明,在应对实例类具有基本上不平衡的数据时,髋关节(选择分层信息保留功能)方法总体上最优异的是,在预测准确性和鲁棒性方面表现最佳。本文还报告了最常见的髋关节方法的基因本体论术语的清单。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号