Using Embedding Models for Lexical Categorization in Morphologically Rich Languages

机译：在形态上丰富语言中使用嵌入模型进行词法分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural-network-based semantic embedding models are relatively new but popular tools in the field of natural language processing. It has been shown that continuous embedding vectors assigned to words provide an adequate representation of their meaning in the case of English. However, morphologically rich languages have not yet been the subject of experiments with these embedding models. In this paper, we investigate the performance of embedding models for Hungarian, trained on corpora with different levels of preprocessing. The models are evaluated on various lexical categorization tasks. They are used for enriching the lexical database of a morphological analyzer with semantic features automatically extracted from the corpora.

机译：基于神经网络的语义嵌入模型是在自然语言处理领域的相对较新的但流行的工具。已经表明，在英语的情况下，分配给单词的连续嵌入向量提供了其含义的充分表示。然而，形态学丰富的语言尚未成为这些嵌入模型的实验的主题。在本文中，我们调查匈牙利嵌入式模型的性能，在具有不同层次的预处理培训。这些模型在各种词法分类任务上进行评估。它们用于丰富与语料库中自动提取的语义功能的形态分析仪的词汇数据库。

著录项

来源
《Annual International Conference on Computational Linguistics and Intelligent Text Processing》|2018年|678p|共12页
会议地点
作者
Borbala Siklosi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP312-53;
关键词

相似文献

外文文献
中文文献
专利

1. Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic [J] . Sarikaya R., Afify M., Deng Y., IEEE transactions on audio, speech and language processing . 2008,第7期

机译：形态-词汇联合语言建模，用于处理形态丰富的语言及其在方言阿拉伯语中的应用
2. Native-likeness in second language lexical categorization reflects individual language history and linguistic community norms [J] . Benjamin D. Zinszer, Barbara C. Malt, Eef Ameel, Frontiers in Psychology . 2014,第4期

机译：第二语言词汇分类中的本地相似性反映了个别语言的历史和语言社区规范
3. From Lexical Tone to Lexical Stress: A Cross-Language Mediation Model for Cantonese Children Learning English as a Second Language [J] . William Choi, Xiuli Tong, Leher Singh Frontiers in Psychology . 2017,第1期

机译：从词汇语气到词汇重音：针对以英语为第二语言学习的广东儿童的跨语言中介模型
4. Using Embedding Models for Lexical Categorization in Morphologically Rich Languages [C] . Borbala Sikl6si International conference on intelligent text processing and computational linguistics . 2018

机译：使用嵌入模型在形态丰富的语言中进行词汇分类
5. The effect of exposure to phonological variation on perceptual categorization and lexical access in second language Spanish: The case of /s/-aspiration in Western Andalusian Spanish. [D] . Bedinghaus, Robert William. 2015

机译：暴露于语音变化对第二语言西班牙语的感知分类和词汇访问的影响：以西安达卢西亚西班牙语中的/ s / -aspiration为例。
6. Native-likeness in second language lexical categorization reflects individual language history and linguistic community norms [O] . Benjamin D. Zinszer, Barbara C. Malt, Eef Ameel, 2014

机译：第二语言词汇分类中的母语相似性反映了个别语言的历史和语言社区规范
7. Joint morphological-lexical language modeling for processing morphologically rich languages with application to dialectal Arabic [O] . Sarıkaya Ruhi, Sarikaya Ruhi, Afify Mohamed, 2008

机译：形态学-词汇语言联合模型，用于处理形态丰富的语言，并应用于方言阿拉伯语

Using Embedding Models for Lexical Categorization in Morphologically Rich Languages

摘要

著录项

相似文献

相关主题

期刊订阅