首页> 外文期刊>Pattern recognition letters >How to utilize syllable distribution patterns as the input of LSTM for Korean morphological analysis
【24h】

How to utilize syllable distribution patterns as the input of LSTM for Korean morphological analysis

机译:如何利用音节分布模式作为LSTM的朝鲜语形态分析输入

获取原文
获取原文并翻译 | 示例
       

摘要

This paper proposes the use of syllable distribution patterns as deep learning inputs for morphological analysis. The proposed syllable distribution pattern comprises two parts: a distributed syllable embedding vector and a morpheme syllable-level distribution pattern. As a learning method, we utilize bidirectional long short-term memory with a conditional random field layer (Bi-LSTM-CRF) for Korean part-of-speech tagging tasks. After syllable-level outputs are generated by Bi-LSTM-CRF, a morpheme restoration process is performed utilizing pre-analyzed dictionaries that were automatically created from a training corpus. Experimental results reveal outstanding performance for the proposed method with an F1-score of 98.65%. (C) 2018 Elsevier B.V. All rights reserved.
机译:本文提出使用音节分布模式作为形态学分析的深度学习输入。提出的音节分布模式包括两部分:一个分布式音节嵌入向量和一个词素音节级分布模式。作为一种学习方法,我们将双向长短期记忆与条件随机字段层(Bi-LSTM-CRF)结合在一起,用于朝鲜语词性标记任务。通过Bi-LSTM-CRF生成音节级别的输出后,将使用从训练语料库自动创建的预先分析的词典执行语素恢复过程。实验结果表明,该方法的F1分数为98.65%,具有出色的性能。 (C)2018 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号