...
首页> 外文期刊>Computational and mathematical methods in medicine >Research on Hybrid Feature Selection Method Based on Iterative Approximation Markov Blanket
【24h】

Research on Hybrid Feature Selection Method Based on Iterative Approximation Markov Blanket

机译:基于迭代近似马尔可夫毯的混合特征选择方法研究

获取原文
           

摘要

The basic experimental data of traditional Chinese medicine are generally obtained by high-performance liquid chromatography and mass spectrometry. The data often show the characteristics of high dimensionality and few samples, and there are many irrelevant features and redundant features in the data, which bring challenges to the in-depth exploration of Chinese medicine material information. A hybrid feature selection method based on iterative approximate Markov blanket (CI_AMB) is proposed in the paper. The method uses the maximum information coefficient to measure the correlation between features and target variables and achieves the purpose of filtering irrelevant features according to the evaluation criteria, firstly. The iterative approximation Markov blanket strategy analyzes the redundancy between features and implements the elimination of redundant features and then selects an effective feature subset finally. Comparative experiments using traditional Chinese medicine material basic experimental data and UCI’s multiple public datasets show that the new method has a better advantage to select a small number of highly explanatory features, compared with Lasso, XGBoost, and the classic approximate Markov blanket method.
机译:中药的基本实验数据通常通过高效液相色谱和质谱法获得。数据经常显示高维度和少数样本的特征,数据中存在许多无关的特征和冗余功能,这对中药材料的深入探索带来了挑战。提出了一种基于迭代近似马尔可夫毯(CI_amb)的混合特征选择方法。该方法使用最大信息系数来测量特征和目标变量之间的相关性,并根据评估标准实现滤波无关的特征的目的。迭代近似马尔可夫毯策略分析了特征之间的冗余,实现了消除冗余功能,然后最后选择有效的特征子集。比较实验采用中药材料基本实验数据和UCI的多个公共数据集显示新方法具有更好的优势,可以更好地选择少数高度解释功能,与套索,XGBoost和经典的近似马尔多维毯方法相比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号