...
首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Out-of-vocabulary word modeling by using sub-word units
【24h】

Out-of-vocabulary word modeling by using sub-word units

机译:使用子字单元的词汇形状建模

获取原文
获取原文并翻译 | 示例
           

摘要

A structured language model (STLM) is proposed to cope with out-of-vocabulary (OOV) words coming from multiple word-classes. The STLM aims at independently modeling the classes without interference and identifying the class of words arising from multiple word-classes. The STLM consists of the conventional word-class N-gram and the sets of the independent-trained class-specific sub-word N-grams. We made an experimental language model by using STLM for the two similar proper-noun classes and performed the speech recognition experiments. The results show that any OOV word of the one class is never misrecognized as that of the other class. The results show that the STLM could integrate the multiple different statistical language models with no interference.
机译:提出了一种结构化语言模型(STLM)以应对来自多个单类类的词汇流(OOV)单词。 STLM旨在独立地建模类而不干扰,并识别来自多个单个类别的单词类。 STLM由传统的单词N-GRAM和独立培训的类特定类子字N-gram组成。 我们通过使用STLM为两个类似的正确名词类进行了实验语言模型,并执行了语音识别实验。 结果表明,任何一类的任何OOV字永远不会被误导为另一类类别。 结果表明,STLM可以整合多种不同的统计语言模型,不会干扰。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号