首页> 外文期刊>电子科学学刊(英文版) >WORD SENSE DISAMBIGUATION BASED ON IMPROVED BAYESIAN CLASSIFIERS
【24h】

WORD SENSE DISAMBIGUATION BASED ON IMPROVED BAYESIAN CLASSIFIERS

机译:基于改进贝叶斯分类器的词义消歧

获取原文
获取原文并翻译 | 示例
       

摘要

Word Sense Disambiguation (WSD) is to decide the sense of an ambiguous word on particular context. Most of current studies on WSD only use several ambiguous words as test samples, thus leads to some limitation in practical application. In this paper, we perform WSD study based on large scale real-world corpus using two unsupervised learning algorithms based on ±n-improved Bayesian model and Dependency Grammar(DG)-improved Bayesian model. ±n-improved classifiers reduce the window size of context of ambiguous words with close-distance feature extraction method, and decrease the jamming of useless features, thus obviously improve the accuracy, reaching 83.18% (in open test). DG-improved classifier can more effectively conquer the noise effect existing in Naive-Bayesian classifier. Experimental results show that this approach does better on Chinese WSD, and the open test achieved an accuracy of 86.27%.
机译:词义歧义消除(WSD)用于确定特定上下文中歧义词的含义。当前对WSD的大多数研究仅使用几个歧义词作为测试样本,因此在实际应用中会受到一些限制。在本文中,我们使用基于±n改进的贝叶斯模型和依存语法(DG)改进的贝叶斯模型的两种无监督学习算法,基于大规模现实语料库进行WSD研究。 ±n改进的分类器采用近距离特征提取方法减小了模糊词上下文的窗口大小,并减少了无用特征的干扰,从而明显提高了准确性,达到了83.18%(在开放测试中)。 DG改进的分类器可以更有效地克服朴素贝叶斯分类器中存在的噪声效应。实验结果表明,该方法在中国水务署上效果更好,开放测试的准确率达到86.27%。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号