Word Sense Disambiguation Using Inductive Logic Programming

机译：使用归纳逻辑编程的词义消歧

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The identification of the correct sense of a word is necessary for many tasks in automatic natural language processing like machine translation, information retrieval, speech and text processing. Automatic Word Sense Disambiguation (WSD) is difficult and accuracies with state-of-the art methods are substantially lower than in other areas of text understanding like part-of-speech tagging. One shortcoming of these methods is that they do not utilize substantial sources of background knowledge, such as semantic taxonomies and dictionaries, which are now available in electronic form (the methods largely use shallow syntactic features). Empirical results from the use of Inductive Logic Programming (ILP) have repeatedly shown the ability of ILP systems to use diverse sources of background knowledge. In this paper we investigate the use of ILP for WSD in two different ways: (a) as a stand-alone constructor of models for WSD; and (b) to build interesting features, which can then be used by standard model-builders such as SVM. In our experiments we examine a monolingual WSD task using the 32 English verbs contained in the SENSEVAL-3 benchmark data; and a bilingual WSD task using 7 highly ambiguous verbs in machine translation from English to Portuguese. Background knowledge available is from eight sources that provide a wide range of syntactic and semantic information. For both WSD tasks, experimental results show that ILP-constructed models and models built using ILP-generated features have higher accuracies than those obtained using a state-of-the art feature-based technique equipped with shallow syntactic features. This suggests that the use of ILP with diverse sources of background knowledge can provide one way for making substantial progress in the field of automatic WSD.

机译：对于自动自然语言处理中的许多任务（例如机器翻译，信息检索，语音和文本处理），识别正确的词义是必不可少的。自动词义消除歧义（WSD）很难，并且最新技术的准确性要远低于诸如词性标记之类的其他文本理解领域。这些方法的一个缺点是它们没有利用大量的背景知识资源，例如语义分类法和字典，这些资源现在可以以电子形式获得（这些方法主要使用浅层语法特征）。使用归纳逻辑编程（ILP）得出的经验结果反复表明，ILP系统具有使用各种背景知识资源的能力。在本文中，我们以两种不同的方式研究了ILP在WSD中的使用：（a）作为WSD模型的独立构造函数；（b）构建有趣的功能，然后可以由标准模型构建器（例如SVM）使用。在我们的实验中，我们使用SENSEVAL-3基准数据中包含的32个英语动词检查了单语WSD任务；和双语的WSD任务，在从英语到葡萄牙语的机器翻译中使用7个高度歧义的动词。现有的背景知识来自八个来源，可提供广泛的句法和语义信息。对于这两个WSD任务，实验结果表明，使用ILP构造的模型和使用ILP生成的特征构建的模型的准确性要高于使用具有浅句法特征的基于最新特征的技术所获得的准确性。这表明将ILP与各种背景知识一起使用可以为在自动WSD领域取得实质性进展提供一种方法。

著录项

来源
《International Conference on Inductive Logic Programming(ILP 2006); 20060824-27; Santiago de Compostela(ES)》|2006年|P.409-423|共15页
会议地点 Santiago de Compostela(ES)
作者
Lucia Specia; Ashwin Srinivasan; Ganesh Ramakrishnan; Maria das Gracas Volpe Nunes;
展开▼
作者单位

ICMC - University of Sao Paulo, Trabalhador Sao-Carlense, 400, Sao Carlos, 13560-970, Brazil;

IBM India Research Laboratory, Block 1, Indian Institute of Technology, New Delhi 110016, India;

Dept. of Computer Science and Engineering Centre for Health Inf;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Fuzzy Logic for Inculcating Significance of Semantic Relations in Word Sense Disambiguation Using a WordNet Graph [J] . Sonakshi Vij, Amita Jain, Devendra Tayal, International Journal of Fuzzy Systems . 2018,第2期

机译：使用WordNet图谱在语义歧义化中灌输语义关系的意义的模糊逻辑
2. A Sense Annotated Corpus for All-Words Urdu Word Sense Disambiguation [J] . Saeed Ali, Nawab Rao Muhammad Adeel, Stevenson Mark, ACM transactions on Asian language information processing . 2019,第4期

机译：用于全词乌尔都语的词义注释语料库
3. Support vector inductive logic programming outperforms the naive Bayes classifier and inductive logic programming for the classification of bioactive chemical compounds [J] . Edward O. Cannon, Ata Amini, Andreas Bender, Journal of Computer-Aided Molecular Design . 2007,第5期

机译：支持向量归纳逻辑编程的性能优于朴素贝叶斯分类器和归纳逻辑编程，可用于生物活性化合物的分类
4. Word Sense Disambiguation Using Inductive Logic Programming [C] . Lucia Specia, Ashwin Srinivasan, Ganesh Ramakrishnan, International Conference on Inductive Logic Programming . 2007

机译：使用归纳逻辑编程的词感歧义
5. Subjectivity word sense disambiguation: A method for sense-aware subjectivity analysis. [D] . Akkaya, Cem. 2014

机译：主观性词义消歧：一种用于感知感知的主观性分析的方法。
6. Word sense disambiguation for event trigger word detection in biomedicine [O] . David Martinez, Timothy Baldwin 2011

机译：用于生物医学中事件触发词检测的词义消歧
7. Word Sense Disambiguation using Inductive Logic Programming [O] . Lucia Specia, Ashwin Srinivasan, Ganesh Ramakrishnan, 2007

机译：使用归纳逻辑编程的词义消歧
8. Word Domain Disambiguation via Word Sense Disambiguation [R] . Sanfilippo, A. 2006

机译：Word Word消歧通过Word sense消歧

Word Sense Disambiguation Using Inductive Logic Programming

摘要

著录项

相似文献

相关主题

期刊订阅