首页> 外文会议>Trends and applications in knowledge discovery and data mining >A New Feature Selection and Feature Contrasting Approach Based on Quality Metric: Application to Efficient Classification of Complex Textual Data
【24h】

A New Feature Selection and Feature Contrasting Approach Based on Quality Metric: Application to Efficient Classification of Complex Textual Data

机译:基于质量指标的特征选择与特征对比新方法:在复杂文本数据有效分类中的应用

获取原文
获取原文并翻译 | 示例

摘要

Feature maximization is a cluster quality metric which favors clusters with maximum feature representation as regard to their associated data. In this paper we go one step further showing that a straightforward adaptation of such metric can provide a highly efficient feature selection and feature contrasting model in the context of supervised classification. We more especially show that this technique can enhance the performance of classification methods whilst very significantly outperforming (+80%) the state-of-the art feature selection techniques in the case of the classification of unbalanced, highly multidimensional and noisy textual data, with a high degree of similarity between the classes.
机译:特征最大化是一种集群质量度量,它偏向于具有最大特征表示的集群及其关联数据。在本文中,我们进一步迈出了一步,表明在监督分类的情况下,这种度量的直接适应可以提供高效的特征选择和特征对比模型。我们更特别地表明,在对不平衡,高度多维和嘈杂的文本数据进行分类的情况下,该技术可以大大提高分类方法的性能,同时显着优于(+ 80%)最新的特征选择技术。类之间的高度相似性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号