Word Independent Context Pair Classification Model for WordSense Disambiguation

机译：用于词义消歧的词无关上下文对分类模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditionally, word sense disambiguation(WSD) involves a different context classificationmodel for each individual word. Thispaper presents a weakly supervised learningapproach to WSD based on learning a wordindependent context pair classificationmodel. Statistical models are not trained forclassifying the word contexts, but for classifyinga pair of contexts, I.e. determining if apair of contexts of the same ambiguous wordrefers to the same or different senses. Usingthis approach, annotated corpus of a targetword A can be explored to disambiguatesenses of a different word B. Hence, only alimited amount of existing annotated corpusis required in order to disambiguate the entirevocabulary. In this research, maximum entropymodeling is used to train the word independentcontext pair classification model.Then based on the context pair classificationresults, clustering is performed on word mentionsextracted from a large raw corpus. Theresulting context clusters are mapped ontothe external thesaurus WordNet. This approachshows great flexibility to efficientlyintegrate heterogeneous knowledge sources,e.g. trigger words and parsing structures.Based on Senseval-3 Lexical Sample standards,this approach achieves state-of-the-artperformance in the unsupervised learningcategory, and performs comparably with thesupervised Na?ve Bayes system.

机译：传统上，单词义消歧（WSD）涉及不同的上下文分类每个单词的模型。这论文提出了弱监督学习学习单词的WSD方法独立上下文对分类模型。统计模型未经过训练对单词上下文进行分类，但用于分类一对背景确定是否同一歧义词的一对上下文指相同或不同的感觉。使用这种方法，目标的注释语料库可以探索单词A来消除歧义感同一个单词B。因此，只有一个现有注解语料库数量有限为了消除整个歧义词汇。在这项研究中，最大熵建模用于训练独立词上下文对分类模型。然后根据上下文对分类结果，对单词提及进行聚类从大型原始语料库中提取。这结果上下文集群被映射到外部词库WordNet。这种方法显示出极大的灵活性，可以高效地整合异构知识源，例如触发单词和解析结构。根据Senseval-3词法样本标准，这种方法达到了最先进的水平无监督学习中的表现类别，并且与监督朴素贝叶斯系统。

著录项

来源
《43rd Annual Meeting of the Association for Computational Linguistics: Proceeding of the Conference》|2005年|33-39|共7页
会议地点
作者
Cheng Niu; Wei Li; Rohini K. Srihari; Huifeng Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Adaptive and hybrid context-aware fine-grained word sense disambiguation in topic modeling based document representation [J] . Wenbo Li, Einoshin Suzuki Information Processing & Management . 2021,第4期

机译：基于主题建模的文档表示中的自适应和混合上下文感知细粒度歧义歧义
2. Improved convolutional neural network for biomedical word sense disambiguation with enhanced context feature modeling [J] . REN Kai, WANG Shi-Wen Journal of digital information management . 2016,第6期

机译：改进的卷积神经网络，具有增强的上下文特征建模，可用于生物医学单词义消歧
3. Learning model order from labeled and unlabeled data for partially supervised classification, with application to word sense disambiguation [J] . Zheng-Yu Niu, Dong-Hong Ji, Chew Lim Tan Computer speech and language . 2007,第4期

机译：从标记和未标记的数据中学习模型顺序以进行部分监督分类，并应用于词义消歧
4. Context Clustering for Word Sense Disambiguation Based onModeling Pairwise Context Similarities [C] . Cheng Niu, Wei Li, Rohini K. Srihari, ;42nd Annual Meeting of the Association for Computational Linguistics . 2004

机译：基于成对上下文相似度建模的词义消歧上下文聚类
5. Improving Intent Classification By Automatic Data Augmentation Using Word Sense Disambiguation [D] . Garg, Prashant 2018

机译：使用词义消歧通过自动数据增强来改善意图分类
6. Clinical Word Sense Disambiguation with Interactive Search and Classification [O] . Yue Wang, Kai Zheng, Hua Xu, 2016

机译：交互式搜索和分类的临床词义消歧
7. Construction of Context Models for Word Sense Disambiguation [O] . Bernard Brosseau-Villeneuve, Noriko Kando, Jian-Yun Nie 2011

机译：字母歧义的语境模型构建
8. Word Domain Disambiguation via Word Sense Disambiguation [R] . Sanfilippo, A. 2006

机译：Word Word消歧通过Word sense消歧

Word Independent Context Pair Classification Model for WordSense Disambiguation

摘要

著录项

相似文献

相关主题

期刊订阅