首页> 外文会议>International Symposium on Knowledge and Systems Sciences(KSS2004); 20041110-12; Ishikawa(JP) >Improving Word Sense Disambiguation Accuracy Using Naive Bayesian Classifier with Rich Features
【24h】

Improving Word Sense Disambiguation Accuracy Using Naive Bayesian Classifier with Rich Features

机译:使用具有丰富功能的朴素贝叶斯分类器提高词义消歧准确度

获取原文
获取原文并翻译 | 示例

摘要

Word Sense Disambiguation (WSD) is the task of choosing the right sense of an ambiguous word given a context. Resolving the ambiguity of words is one of the most important open problems in Natural Language Processing (NLP) field. It is applied to many NLP tasks such as machine translation, information retrieval, parsing, and text summarization. The Naive Bayesian (NB) classifier is known as one of the best methods for supervised approaches such as in [7, 9], and this model usually uses only a topic context represented by unordered words in a large context. In this paper, we show that by adding more rich knowledge, represented by ordered words in a local context and collocations, WSD using a NB classifier can achieve higher accuracy in comparison with the best previously published results. The features were chosen using a forward sequential selection algorithm. Our experiments obtained 92.3% accuracy for four common test words (interest, line, hard, serve). We also tested on a large dataset, the DSO corpus, and obtained the accuracies of 66.2% for verbs and 73.2% for nouns.
机译:词义消歧(WSD)是在给定上下文的情况下为歧义词选择正确含义的任务。解决单词的歧义是自然语言处理(NLP)领域中最重要的开放问题之一。它适用于许多NLP任务,例如机器翻译,信息检索,解析和文本摘要。朴素贝叶斯(NB)分类器被称为监督方法的最佳方法之一,例如在[7,9]中,并且该模型通常仅在大型上下文中仅使用由无序单词表示的主题上下文。在本文中,我们表明,通过添加更多丰富的知识(由本地上下文中的有序单词表示和并置表示),使用NB分类器的WSD与以前发布的最佳结果相比可以实现更高的准确性。使用正向顺序选择算法选择特征。我们的实验针对四个常见的测试字词(兴趣,趣味,困难,发球)获得了92.3%的准确性。我们还在大型数据集DSO语料库上进行了测试,得出动词的准确度为66.2%,名词的准确度为73.2%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号