Using Embedding Models for Lexical Categorization in Morphologically Rich Languages

机译：使用嵌入模型在形态丰富的语言中进行词汇分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural-network-based semantic embedding models are relatively new but popular tools in the field of natural language processing. It has been shown that continuous embedding vectors assigned to words provide an adequate representation of their meaning in the case of English. However, morphologically rich languages have not yet been the subject of experiments with these embedding models. In this paper, we investigate the performance of embedding models for Hungarian, trained on corpora with different levels of preprocessing. The models are evaluated on various lexical categorization tasks. They are used for enriching the lexical database of a morphological analyzer with semantic features automatically extracted from the corpora.

机译：基于神经网络的语义嵌入模型是自然语言处理领域中相对较新但很流行的工具。已经表明，在英语的情况下，分配给单词的连续嵌入向量可以充分表达其含义。但是，形态丰富的语言尚未成为使用这些嵌入模型进行实验的对象。在本文中，我们研究了匈牙利语嵌入模型的性能，这些模型在具有不同预处理级别的语料库上进行了训练。在各种词汇分类任务上对模型进行评估。它们用于通过从语料库中自动提取的语义特征来丰富形态分析器的词法数据库。

著录项

来源
《International conference on intelligent text processing and computational linguistics》|2018年|115-126|共12页
会议地点
作者
Borbala Sikl6si;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic [J] . Sarikaya R., Afify M., Deng Y., IEEE transactions on audio, speech and language processing . 2008,第7期

机译：形态-词汇联合语言建模，用于处理形态丰富的语言及其在方言阿拉伯语中的应用
2. Native-likeness in second language lexical categorization reflects individual language history and linguistic community norms [J] . Benjamin D. Zinszer, Barbara C. Malt, Eef Ameel, Frontiers in Psychology . 2014,第4期

机译：第二语言词汇分类中的本地相似性反映了个别语言的历史和语言社区规范
3. From Lexical Tone to Lexical Stress: A Cross-Language Mediation Model for Cantonese Children Learning English as a Second Language [J] . William Choi, Xiuli Tong, Leher Singh Frontiers in Psychology . 2017,第1期

机译：从词汇语气到词汇重音：针对以英语为第二语言学习的广东儿童的跨语言中介模型
4. Using Embedding Models for Lexical Categorization in Morphologically Rich Languages [C] . Borbala Siklosi Annual International Conference on Computational Linguistics and Intelligent Text Processing . 2018

机译：在形态上丰富语言中使用嵌入模型进行词法分类
5. The effect of exposure to phonological variation on perceptual categorization and lexical access in second language Spanish: The case of /s/-aspiration in Western Andalusian Spanish. [D] . Bedinghaus, Robert William. 2015

机译：暴露于语音变化对第二语言西班牙语的感知分类和词汇访问的影响：以西安达卢西亚西班牙语中的/ s / -aspiration为例。
6. Native-likeness in second language lexical categorization reflects individual language history and linguistic community norms [O] . Benjamin D. Zinszer, Barbara C. Malt, Eef Ameel, 2014

机译：第二语言词汇分类中的母语相似性反映了个别语言的历史和语言社区规范
7. Joint morphological-lexical language modeling for processing morphologically rich languages with application to dialectal Arabic [O] . Sarıkaya Ruhi, Sarikaya Ruhi, Afify Mohamed, 2008

机译：形态学-词汇语言联合模型，用于处理形态丰富的语言，并应用于方言阿拉伯语

Using Embedding Models for Lexical Categorization in Morphologically Rich Languages

摘要

著录项

相似文献

相关主题

期刊订阅