首页> 外文会议>International Conference on Agents and Artificial Intelligence >Lexical and Morpho-syntactic Features in Word Embeddings - A Case Study of Nouns in Swedish
【24h】

Lexical and Morpho-syntactic Features in Word Embeddings - A Case Study of Nouns in Swedish

机译:词汇嵌入词的词汇和杂语特征 - 以瑞典语为例

获取原文

摘要

We apply real-valued word vectors combined with two different types of classifiers (linear discriminant analysis and feed-forward neural network) to scrutinize whether basic nominal categories can be captured by simple word embedding models. We also provide a linguistic analysis of the errors generated by the classifiers. The targeted language is Swedish, in which we investigate three nominal aspects: uter/neuter, common/proper, and count/mass. They represent respectively grammatical, semantic, and mixed types of nominal classification within languages. Our results show that word embeddings can capture typical grammatical and semantic features such as uter/neuter and common/proper nouns. Nevertheless, the model encounters difficulties to identify classes such as count/mass which not only combine both grammatical and semantic properties, but are also subject to conversion and shift. Hence, we answer the call of the Special Session on Natural Language Processing in Artificial Intelligence by approaching the topic of interfaces between morphology, lexicon, semantics, and syntax via interdisciplinary methods combining machine learning of language and general linguistics.
机译:我们应用真实值的字矢量与两种不同类型的分类器(线性判别分析和前锋神经网络)相结合,以仔细审查基本标称类别是否可以通过简单的单词嵌入模型捕获。我们还提供了对分类器产生的错误的语言分析。目标语言是瑞典语,我们调查了三个标称方面:UTER / NERICER,COMPER / PATCH和COUNT / MASE。它们分别表示语言中的语法,语义和混合类型的名义分类。我们的结果表明,Word Embeddings可以捕获典型的语法和语义特征,例如UTER /中性和共同/适当的名词。然而,模型遇到难以识别诸如计数/质量等类别的困难,这不仅结合了语法和语义属性,而且还可以进行转换和换档。因此,我们通过跨学科方法接近机械学习的语言和通用语言学的机器学习,通过接近形态学,词典,语义和语法之间的界面主题来回答人工智能的自然语言处理的呼唤。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号