首页> 外文期刊>International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms >COMBINATION OF MULTIPLE FEATURE SELECTION METHODS FOR TEXT CATEGORIZATION BY USING COMBINATORIAL FUSION ANALYSIS AND RANK-SCORE CHARACTERISTIC
【24h】

COMBINATION OF MULTIPLE FEATURE SELECTION METHODS FOR TEXT CATEGORIZATION BY USING COMBINATORIAL FUSION ANALYSIS AND RANK-SCORE CHARACTERISTIC

机译:组合融合分析和秩分特征的文本分类多特征选择方法的组合

获取原文
获取原文并翻译 | 示例
       

摘要

Effective feature selection methods are important for improving the efficiency and accuracy of text categorization algorithms by removing redundant and irrelevant terms from the corpus. Extensive research has been done to improve the performance of individual feature selection methods. However, it is always a challenge to come up with an individual feature selection method which would outperform other methods in most cases. In this paper, we explore the possibility of improving the overall performance by combining multiple individual feature selection methods. In particular, we propose a method of combining multiple feature selection methods by using an information fusion paradigm, called Combinatorial Fusion Analysis (CFA). A rank-score function and its associated graph, called rank-score graph, are adopted to measure the diversity of different feature selection methods. Our experimental results demonstrated that a combination of multiple feature selection methods can outperform a single method only if each individual feature selection method has unique scoring behavior and relatively high performance. Moreover, it is shown that the rank-score function and rank-score graph are useful for the selection of a combination of feature selection methods.
机译:有效的特征选择方法对于通过从语料库中删除多余和无关的术语来提高文本分类算法的效率和准确性非常重要。为了提高单个特征选择方法的性能,已经进行了广泛的研究。但是,提出一种单独的特征选择方法始终是一个挑战,在大多数情况下,该方法要优于其他方法。在本文中,我们探索了通过组合多种单个特征选择方法来提高整体性能的可能性。特别是,我们提出了一种通过使用信息融合范例来组合多种特征选择方法的方法,称为组合融合分析(CFA)。秩得分函数及其相关图称为秩得分图,用于度量不同特征选择方法的多样性。我们的实验结果表明,只有每种单独的特征选择方法都具有独特的评分行为和相对较高的性能,多种特征选择方法的组合才能胜过单个方法。此外,示出了等级得分函数和等级得分图对于特征选择方法的组合的选择是有用的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号