首页> 外文会议>International conference on computational linguistics >Towards Semantic Validation of a Derivational Lexicon
【24h】

Towards Semantic Validation of a Derivational Lexicon

机译:朝着衍生词典的语义验证

获取原文

摘要

Derivationally related lemmas like friend_N -friendly_A - friendship_N are derived from a common stem. Frequently, their meanings are also systematically related. However, there are also many examples of derivationally related lemma pairs whose meanings differ substantially, e.g., object_N - objective_N. Most broad-coverage derivational lexicons do not reflect this distinction, mixing up semantically related and unrelated word pairs. In this paper, we investigate strategies to recover the above distinction by recognizing semantically related lemma pairs, a process we call semantic validation. We make two main contributions: First, we perform a detailed data analysis on the basis of a large German derivational lexicon. It reveals two promising sources of information (distributional semantics and structural information about derivational rules), but also systematic problems with these sources. Second, we develop a classification model for the task that reflects the noisy nature of the data. It achieves an improvement of 13.6% in precision and 5.8% in F1-score over a strong majority class baseline. Our experiments confirm that both information sources contribute to semantic validation, and that they are complementary enough that the best results are obtained from a combined model.
机译:衍生相关的相关lemmas,如friend_n-friendly_a - friendship_n源自公共词干。通常,他们的含义也有系统地相关。然而,还存在许多衍生相关的LEMMA对的示例,其含义基本上不同,例如object_n - 目标_n。最广泛的覆盖率衍生词典不反映这种区别,混合语义相关和无关的词对。在本文中,我们调查通过识别语义相关的lemma对来恢复上述区别的策略,这是我们称之为语义验证的过程。我们提出了两个主要贡献:首先,我们根据大型德国衍生词典进行详细的数据分析。它揭示了两个有前途的信息来源(分布语义和有关衍生规则的结构信息),但也有这些来源的系统问题。其次,我们为反映数据嘈杂性质的任务开发了一个分类模型。它以强大的大多数类基线实现了13.6%的精度和5.8%的5.8%。我们的实验证实,两种信息来源都有助于语义验证,并且它们是足够互补的,以至于从组合模型获得最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号