首页> 外文会议>Computational Linguistics and Intelligent Text Processing >Feature Selection for Chinese Character Sense Discrimination
【24h】

Feature Selection for Chinese Character Sense Discrimination

机译:汉字义辨的特征选择

获取原文

摘要

Word sense discrimination is to group occurrences of a word into clusters based on unsupervised classification method, where each cluster consists of occurrences having same meaning. Feature extraction method has been used to reduce the dimension of context vector in English word sense discrimination task. But if original dimension has a real meaning to users and relevant features exist in original dimensions, feature selection is a better choice for finding relevant features. In this paper we apply two unsupervised feature selection schemes to Chinese character sense discrimination, which are entropy based feature filter and Minimum Description Length based feature wrapper. Using precision evaluation and known ground-truth classification result, our preliminary experiment results demonstrate that feature selection method performs better than feature extraction method on Chinese character sense discrimination task.
机译:词义辨别是基于无监督分类方法将单词的出现分组为类,其中每个类由具有相同含义的出现组成。特征提取方法已被用于减少英语单词义辨别任务中上下文向量的维数。但是,如果原始尺寸对用户具有真正的意义,并且相关特征存在于原始尺寸中,则特征选择是查找相关特征的更好选择。在本文中,我们将两种无监督的特征选择方案应用于汉字感官判别,这两种方法是基于熵的特征过滤器和基于最小描述长度的特征包装器。利用精确度评估和已知的地面真伪分类结果,我们的初步实验结果表明,特征选择方法在特征识别方法上的性能优于特征提取方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号