首页> 外文会议>7th International Conference on Natural Language Processing and Knowledge Engineering >A novel feature selection methodology based on outlier detection technologies
【24h】

A novel feature selection methodology based on outlier detection technologies

机译:基于异常检测技术的新颖特征选择方法

获取原文
获取原文并翻译 | 示例

摘要

Feature selection is becoming more and more important for natural language processing as well as knowledge engineering. In this paper, we induce a simple principle that if an attribute subset has more representativeness, then it should be more self-organized, as a result it should be more insensitive to artificially seeded noise points. Based on that, our novel methodology transforms feature selection problems into outlier detection problems. Because of the characteristics of outlier detection problems, our framework can achieve high tolerance of noises, sub-samplings, and even classification errors in training data sets, which are extraordinary features of our method. Moreover, to evaluate the performance of our method comprehensively, we compare our method with several state-of-the-art methods on a number of real-life data sets, and give all the experiment results, which show that our method can accomplish feature reduction tasks with really high accuracy as well as remarkably low computing complexity.
机译:对于自然语言处理以及知识工程而言,特征选择正变得越来越重要。在本文中,我们得出一个简单的原理,即如果属性子集具有更大的代表性,则它应该更加自组织,因此,它对人工种子的噪声点应该不敏感。基于此,我们的新颖方法将特征选择问题转换为离群值检测问题。由于异常检测问题的特征,我们的框架可以在训练数据集中实现对噪声,子采样甚至分类错误的高容忍度,这是我们方法的非凡特征。此外,为了全面评估我们的方法的性能,我们在大量真实数据集上将我们的方法与几种最新方法进行了比较,并给出了所有实验结果,表明我们的方法可以完成功能真正意义上的精简任务,以及极低的计算复杂度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号