首页> 外文会议>International conference on intelligent text processing and computational linguistics >Using Embedding Models for Lexical Categorization in Morphologically Rich Languages
【24h】

Using Embedding Models for Lexical Categorization in Morphologically Rich Languages

机译:使用嵌入模型在形态丰富的语言中进行词汇分类

获取原文

摘要

Neural-network-based semantic embedding models are relatively new but popular tools in the field of natural language processing. It has been shown that continuous embedding vectors assigned to words provide an adequate representation of their meaning in the case of English. However, morphologically rich languages have not yet been the subject of experiments with these embedding models. In this paper, we investigate the performance of embedding models for Hungarian, trained on corpora with different levels of preprocessing. The models are evaluated on various lexical categorization tasks. They are used for enriching the lexical database of a morphological analyzer with semantic features automatically extracted from the corpora.
机译:基于神经网络的语义嵌入模型是自然语言处理领域中相对较新但很流行的工具。已经表明,在英语的情况下,分配给单词的连续嵌入向量可以充分表达其含义。但是,形态丰富的语言尚未成为使用这些嵌入模型进行实验的对象。在本文中,我们研究了匈牙利语嵌入模型的性能,这些模型在具有不同预处理级别的语料库上进行了训练。在各种词汇分类任务上对模型进行评估。它们用于通过从语料库中自动提取的语义特征来丰富形态分析器的词法数据库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号