首页> 外文会议>IEEE Global Conference on Consumer Electronics >Syllable-Level Long Short-Term Memory Recurrent Neural Network-based Language Model for Korean Voice Interface in Intelligent Personal Assistants
【24h】

Syllable-Level Long Short-Term Memory Recurrent Neural Network-based Language Model for Korean Voice Interface in Intelligent Personal Assistants

机译:音节级智能个人助理中韩国语音接口的基于神经网络的基于神经网络的语言模型

获取原文

摘要

This study proposes a syllable-level long short-term memory (LSTM) recurrent neural network (RNN)-based language model for a Korean voice interface in intelligent personal assistants (IPAs). Most Korean voice interfaces in IPAs use word-level $n$ -gram language models. Such models suffer from the following two problems: 1) the syntax information in a longer word history is limited because of the limitation of $n$ and 2) The out-of-vocabulary (OOV) problem can occur in a word-based vocabulary. To solve the first problem, the proposed model uses an LSTM RNN-based language model because an LSTM RNN provides long-term dependency information. To solve the second problem, the proposed model is trained with a syllable-level text corpus. Korean words comprise syllables, and therefore, OOV words are not presented in a syllable-based lexicon. In experiments, the RNN-based language model and the proposed model achieved perplexity (PPL) of 68.74 and 17.81, respectively.
机译:本研究提出了一个音节级长的短期内存(LSTM)复发性神经网络(RNN)为智能个人助理(IPAS)中的韩国语音接口的语言模型。 IPA中的大多数韩国语音接口都使用字级 $ n $ -gram语言模型。此类模型遭受以下两个问题:1)由于限制,更长的单词历史中的语法信息是有限的 $ n $ 2)在基于词的词汇表中可能发生词汇流(OOV)问题。为了解决第一问题,所提出的模型使用基于LSTM RNN的语言模型,因为LSTM RNN提供了长期依​​赖信息。为了解决第二个问题,所提出的模型用音节级文本语料库训练。韩语单词包含音节,因此,基于音节的词典中没有呈现OOV字。在实验中,基于RNN的语言模型和所提出的模型分别实现了68.74和17.81的困惑(PPL)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号