首页> 外文期刊>Language Resources and Evaluation >PQAC-WN: constructing a wordnet for Pre-Qin ancient Chinese
【24h】

PQAC-WN: constructing a wordnet for Pre-Qin ancient Chinese

机译:PQAC-WN:为先秦古代汉语构建词汇网

获取原文
获取原文并翻译 | 示例
       

摘要

The Princeton WordNet(A (R)) (PWN) is a widely used lexical knowledge database for semantic information processing. There are now many wordnets under creation for languages worldwide. In this paper, we endeavor to construct a wordnet for Pre-Qin ancient Chinese (PQAC), called PQAC WordNet (PQAC-WN), to process the semantic information of PQAC. In previous work, most recently constructed wordnets have been established either manually by experts or automatically using resources from which translation pairs between English and the target language can be extracted. The former method, however, is time-consuming, and the latter method, owing to a lack of language resources, cannot be performed on PQAC. As a result, a method based on word definitions in a monolingual dictionary is proposed. Specifically, for each sense, kernel words are first extracted from its definition, and the senses of each kernel word are then determined by graph-based Word Sense Disambiguation. Finally, one optimal sense is chosen from the kernel word senses to guide the mapping between the word sense and PWN synset. In this research, we obtain 66 % PQAC senses that can be shared with English and another 14 % language-specific senses that were added to PQAC-WN as new synsets. Overall, the automatic mapping achieves a precision of over 85 %.
机译:普林斯顿WordNet(A)(PWN)是广泛用于语义信息处理的词汇知识数据库。现在正在为全球语言创建许多词网。在本文中,我们努力为先秦古代汉语(PQAC)构建一个词网,称为PQAC词网(PQAC-WN),以处理PQAC的语义信息。在以前的工作中,由专家手动建立或使用资源自动建立最近构造的词网,可以从中提取英语和目标语言之间的翻译对。但是,前一种方法很耗时,而后一种方法由于缺少语言资源而无法在PQAC上执行。结果,提出了一种基于单语词典中的单词定义的方法。具体来说,对于每种感觉,首先从其定义中提取内核词,然后通过基于图的词义消除歧义确定每个内核词的感觉。最后,从内核词义中选择一个最佳义,以指导词义与PWN同义词集之间的映射。在这项研究中,我们获得了可以与英语共享的66%的PQAC感官,以及作为新的同义词集添加到PQAC-WN中的另外14%的特定于语言的感官。总体而言,自动映射可实现超过85%的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号