首页> 外文会议>Annual International Conference on Computational Linguistics and Intelligent Text Processing >Using Embedding Models for Lexical Categorization in Morphologically Rich Languages
【24h】

Using Embedding Models for Lexical Categorization in Morphologically Rich Languages

机译:在形态上丰富语言中使用嵌入模型进行词法分类

获取原文

摘要

Neural-network-based semantic embedding models are relatively new but popular tools in the field of natural language processing. It has been shown that continuous embedding vectors assigned to words provide an adequate representation of their meaning in the case of English. However, morphologically rich languages have not yet been the subject of experiments with these embedding models. In this paper, we investigate the performance of embedding models for Hungarian, trained on corpora with different levels of preprocessing. The models are evaluated on various lexical categorization tasks. They are used for enriching the lexical database of a morphological analyzer with semantic features automatically extracted from the corpora.
机译:基于神经网络的语义嵌入模型是在自然语言处理领域的相对较新的但流行的工具。已经表明,在英语的情况下,分配给单词的连续嵌入向量提供了其含义的充分表示。然而,形态学丰富的语言尚未成为这些嵌入模型的实验的主题。在本文中,我们调查匈牙利嵌入式模型的性能,在具有不同层次的预处理培训。这些模型在各种词法分类任务上进行评估。它们用于丰富与语料库中自动提取的语义功能的形态分析仪的词汇数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号