An Improved Minimum Redundancy Maximum Relevance Approach for Feature Selection in Gene Expression Data

机译：基因表达数据中特征选择的提高最小冗余最大关联方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article, an improved feature selection technique has been proposed. Mutual Information is taken as the basic criterion to find the feature relevance and redundancy. The mutual information between a feature and class labels defines the relevance of that feature. Again, the mutual information among different features defines the correlation i.e., the redundancy among those features. Now our objective is to find such a feature set for which the mutual information among the features and the class labels are maximized and the mutual information among the features are minimized. Therefore, the goal of the proposed method is to find the most relevant and least redundant feature set. The number of output features is provided by the user. First the most relevant feature is added to the empty final feature set. Then in each iteration a non-dominated feature set with respect to relevance and redundancy is generated and from this set of features, the most relevant and non-redundant feature is included in the final feature set. Thereafter, in an incremental way a feature is added in every iteration and this step is repeated while the size of the final feature set is equal to the user given number of features. The features contained by the final feature set have maximum relevance and least correlation. The proposed method is applied on microarray gene expression data to find the most relevant and non-redundant genes and the performance of the proposed method is compared with that of the popular mRMR (MIQ) and mRMR (MID) schemes on several real-life data sets.

机译：在本文中，已经提出了一种改进的特征选择技术。相互信息被视为找到特征相关性和冗余的基本标准。功能和类标签之间的互信息定义了该功能的相关性。同样，不同特征之间的互信息定义了这些特征中的冗余。现在我们的目的是找到这样的特征集，其中特征和类标签之间的互信息最大化，并且特征之间的互信息被最小化。因此，所提出的方法的目标是找到最相关和最冗余的功能集。用户提供的输出功能数。首先将最相关的功能添加到空最终功能集中。然后在每次迭代中，生成关于相关性和冗余的非主导特征集，并且从该组特征中，最相关和非冗余功能包括在最终功能集中。此后，以增量方式在每次迭代中添加特征，并且在最终特征集的大小等于用户的特征数量时重复该步骤。最终功能集中包含的特征具有最大相关性和最不相关性。所提出的方法应用于微阵列基因表达数据，以找到最相关和最冗余的基因，并将所提出的方法的性能与若干现实数据的流行MRMR（MIQ）和MRMR（MID）方案进行比较套。

著录项

来源
《International Conference on Computational Intelligence Modeling Techniques and Applications》|2014年||共8页
会议地点
作者
Monalisa Mandal; Anirban Mukhopadhyay;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类可计算性理论;
关键词
Mutual Information; Relevance; Redundancy; Non-dominated Features.;

机译：相互信息;相关性;冗余;非主导功能。;

相似文献

外文文献
中文文献
专利

1. Minimum redundancy maximum relevance feature selection approach for temporal gene expression data [J] . Milos Radovic, Mohamed Ghalwash, Nenad Filipovic, BMC Bioinformatics . 2017,第1期

机译：时间基因表达数据的最小冗余最大相关特征选择方法
2. Using covariates for improving the minimum redundancy maximum relevance feature selection method [J] . OLCAY KUR?UN, CEMAL OKAN ?AKAR, OLEG FAVOROV, Turkish Journal of Electrical Engineering and Computer Sciences . 2010,第6期

机译：使用协变量改进最小冗余最大相关特征选择方法
3. Maximum relevance minimum common redundancy feature selection for nonlinear data [J] . Jinxing Che, Youlong Yang, Li Li, Information Sciences: An International Journal . 2017,第期

机译：非线性数据的最大相关性最小常见冗余特征选择
4. An Improved Minimum Redundancy Maximum Relevance Approach for Feature Selection in Gene Expression Data [C] . Monalisa Mandal, Anirban Mukhopadhyay International Conference on Computational Intelligence Modeling Techniques and Applications . 2014

机译：基因表达数据中特征选择的提高最小冗余最大关联方法
5. New methods for variable selection with applications to survival analysis and statistical redundancy analysis using gene expression data. [D] . Hu, Simin. 2007

机译：变量选择的新方法，应用于通过基因表达数据进行的生存分析和统计冗余分析。
6. Minimum redundancy maximum relevance feature selection approach for temporal gene expression data [O] . Milos Radovic, Mohamed Ghalwash, Nenad Filipovic, 2017

机译：时间基因表达数据的最小冗余最大相关特征选择方法
7. An Improved Minimum Redundancy Maximum Relevance Approach for Feature Selection in Gene Expression Data [O] . Mandal Monalisa, Mukhopadhyay Anirban 2013

机译：基因表达数据中特征选择的改进最小冗余最大相关性方法

An Improved Minimum Redundancy Maximum Relevance Approach for Feature Selection in Gene Expression Data

摘要

著录项

相似文献

相关主题

期刊订阅