首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Out-of-vocabulary word modeling by using sub-word units
【24h】

Out-of-vocabulary word modeling by using sub-word units

机译:使用子词单元进行词汇外词建模

获取原文
获取原文并翻译 | 示例
           

摘要

A structured language model (STLM) is proposed to cope with out-of-vocabulary (OOV) words coming from multiple word-classes. The STLM aims at independently modeling the classes without interference and identifying the class of words arising from multiple word-classes. The STLM consists of the conventional word-class N-gram and the sets of the independent-trained class-specific sub-word N-grams. We made an experimental language model by using STLM for the two similar proper-noun classes and performed the speech recognition experiments. The results show that any OOV word of the one class is never misrecognized as that of the other class. The results show that the STLM could integrate the multiple different statistical language models with no interference.
机译:提出了一种结构化语言模型(STLM)来应对来自多个单词类别的词汇外(OOV)单词。 STLM的目标是在不干扰的情况下对类别进行独立建模,并识别由多个单词类别引起的单词类别。 STLM由常规单词类N-gram和独立训练的特定于类的子单词N-gram的集合组成。我们通过将STLM用于两个相似的专有名词类来建立实验语言模型,并进行了语音识别实验。结果表明,一类的任何OOV单词都不会像另一类的一样被误认。结果表明,STLM可以不受干扰地集成多种不同的统计语言模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号