首页> 外文会议>Pacific-Asia conference on knowledge discovery and data mining >A New Feature Selection and Feature Contrasting Approach Based on Quality Metric: Application to Efficient Classification of Complex Textual Data
【24h】

A New Feature Selection and Feature Contrasting Approach Based on Quality Metric: Application to Efficient Classification of Complex Textual Data

机译:基于质量指标的新特征选择和特征对比方法:应用于复杂文本数据的高效分类

获取原文

摘要

Feature maximization is a cluster quality metric which favors clusters with maximum feature representation as regard to their associated data. In this paper we go one step further showing that a straightforward adaptation of such metric can provide a highly efficient feature selection and feature contrasting model in the context of supervised classification. We more especially show that this technique can enhance the performance of classification methods whilst very significantly outperforming (+80%) the state-of-the art feature selection techniques in the case of the classification of unbalanced, highly multidimensional and noisy textual data, with a high degree of similarity between the classes.
机译:特征最大化是一种群集质量指标,其涉及具有最大特征表示的群集,视为其关联数据。在本文中,我们进一步示出了这种度量的直接适应可以在监督分类的上下文中提供高效的特征选择和特征对比模型。我们更特别地表明,这种技术可以增强分类方法的性能,同时非常显着优于(+ 80%),在不平衡,高度多维和嘈杂的文本数据的分类的情况下,最先进的特征选择技术课程之间的高度相似性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号