首页> 外文会议>43rd Annual Meeting of the Association for Computational Linguistics: Proceeding of the Conference >Word Independent Context Pair Classification Model for WordSense Disambiguation
【24h】

Word Independent Context Pair Classification Model for WordSense Disambiguation

机译:用于词义消歧的词无关上下文对分类模型

获取原文

摘要

Traditionally, word sense disambiguation(WSD) involves a different context classificationmodel for each individual word. Thispaper presents a weakly supervised learningapproach to WSD based on learning a wordindependent context pair classificationmodel. Statistical models are not trained forclassifying the word contexts, but for classifyinga pair of contexts, I.e. determining if apair of contexts of the same ambiguous wordrefers to the same or different senses. Usingthis approach, annotated corpus of a targetword A can be explored to disambiguatesenses of a different word B. Hence, only alimited amount of existing annotated corpusis required in order to disambiguate the entirevocabulary. In this research, maximum entropymodeling is used to train the word independentcontext pair classification model.Then based on the context pair classificationresults, clustering is performed on word mentionsextracted from a large raw corpus. Theresulting context clusters are mapped ontothe external thesaurus WordNet. This approachshows great flexibility to efficientlyintegrate heterogeneous knowledge sources,e.g. trigger words and parsing structures.Based on Senseval-3 Lexical Sample standards,this approach achieves state-of-the-artperformance in the unsupervised learningcategory, and performs comparably with thesupervised Na?ve Bayes system.
机译:传统上,单词义消歧 (WSD)涉及不同的上下文分类 每个单词的模型。这 论文提出了弱监督学习 学习单词的WSD方法 独立上下文对分类 模型。统计模型未经过训练 对单词上下文进行分类,但用于分类 一对背景确定是否 同一歧义词的一对上下文 指相同或不同的感觉。使用 这种方法,目标的注释语料库 可以探索单词A来消除歧义 感同一个单词B。因此,只有一个 现有注解语料库数量有限 为了消除整个歧义 词汇。在这项研究中,最大熵 建模用于训练独立词 上下文对分类模型。 然后根据上下文对分类 结果,对单词提及进行聚类 从大型原始语料库中提取。这 结果上下文集群被映射到 外部词库WordNet。这种方法 显示出极大的灵活性,可以高效地 整合异构知识源, 例如触发单词和解析结构。 根据Senseval-3词法样本标准, 这种方法达到了最先进的水平 无监督学习中的表现 类别,并且与 监督朴素贝叶斯系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号