Small in Size, Big in Precision: A Case for Using Language-Specific Lexical Resources for Word Sense Disambiguation

机译：大小小，精确度大：一种使用语言特定的词汇资源的案例，用于字感消歧

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Linked open data (LOD) presents an ideal platform for connecting the multilingual lexical resources used in natural language processing (NLP) tasks, but the use of machine translation to fill in gaps in lexical coverage for resource-poor languages means that large amounts of data are potentially unverified. For graph-based word sense disambiguation (WSD), one approach has been to first translate terms into English in order to disambiguate using richer, fuller lexical knowledge bases (LKBs) such as WordNet. In this paper, we show that this approach actually creates more ambiguity and is far less accurate than using language-specific resources, which, regardless of their smaller size, can provide results comparable in accuracy to the state-of-the-art reported for graph-based WSD in English. For LOD, this demonstrates the importance of continuing to grow and extend language-specific resources in order to continually verify and reintegrate them as accurate resources.

机译：链接的开放数据（LOD）提供了一个理想的平台，用于连接自然语言处理（NLP）任务中使用的多语种词汇资源，但使用机器翻译以填补资源差的语言的词汇覆盖范围意味着大量数据可能是未经证实的。对于基于图形的单词感歧义（WSD），一种方法将首先将术语翻译成英文，以消除使用更丰富，更富勒词的知识库（LKB），例如Wordnet。在本文中，我们表明，这种方法实际上创造了更加模糊性，并且比使用特定语言的资源更准确，这是无论其较小的尺寸如何，可以为最先进的最先进的准确性提供相当的结果。基于图形的WSD英文。对于LOD，这证明了继续增长和扩展特定语言资源的重要性，以便不断验证和重新融入它们作为准确的资源。

著录项

来源
《International conference on recent advances in natural language processing》|2015年||共10页
会议地点
作者
Steven Neale; Joao Silva; Antonio Branco;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Ant colony algorithm for Arabic word sense disambiguation through English lexical information [J] . Abdelaali Bakhouche, Tlili Yamina, Didier Schwab, International journal of metadata, semantics and ontologies . 2015,第3期

机译：通过英语词汇信息消除阿拉伯语词义歧义的蚁群算法
2. An accurate word sense disambiguation system based on weighted lexical features [J] . Abdoreza Rezapour, Seyed Mostafa Fakhrahmad, Mohammad Hadi Sadreddini, Literary & linguistic computing . 2014,第1期

机译：基于加权词法特征的准确词义消歧系统
3. SenseDefs: a multilingual corpus of semantically annotated textual definitions Exploiting multiple languages and resources jointly for high-quality Word Sense Disambiguation and Entity Linking [J] . Camacho-Collados Jose, Bovi Claudio Delli, Raganato Alessandro, Language Resources and Evaluation . 2019,第2期

机译：SenseDefs：带有语义注释的文本定义的多语言语料库共同开发多种语言和资源，以实现高质量的词义消歧和实体链接
4. Small in Size, Big in Precision: A Case for Using Language-Specific Lexical Resources for Word Sense Disambiguation [C] . Steven Neale, Joao Silva, Antonio Branco 2nd Workshop on natural language processing and linked open data 2015 . 2015

机译：体积小，精度大：使用特定语言的词汇资源进行词义消歧的案例
5. Investigations into the role of lexical semantics in word sense disambiguation. [D] . Dang, Hoa Trang. 2004

机译：调查词义歧义中词汇语义的作用。
6. Word sense disambiguation for event trigger word detection in biomedicine [O] . David Martinez, Timothy Baldwin 2011

机译：用于生物医学中事件触发词检测的词义消歧
7. Forming an Integrated Lexical Resource for Word Sense Disambiguation [O] . Kwong Oi Yee 2001

机译：形成用于词义消歧的综合词汇资源
8. Word Domain Disambiguation via Word Sense Disambiguation [R] . Sanfilippo, A. 2006

机译：Word Word消歧通过Word sense消歧

Small in Size, Big in Precision: A Case for Using Language-Specific Lexical Resources for Word Sense Disambiguation

摘要

著录项

相似文献

相关主题

期刊订阅