首页> 外国专利> Spoken language corpus generation device and program thereof

Spoken language corpus generation device and program thereof

机译:口语语料库生成装置及其程序

摘要

PROBLEM TO BE SOLVED: To provide a speech language corpus generation device that generates a speech language corpus for learning an acoustic model used in speech recognition of a specific program.SOLUTION: A speech language corpus generation device 1 includes: mismatch probability calculation means 12 being associated with a corresponding pattern between a subtitle text and the recognition hypotheses from the recognition hypotheses recognizing speech of a specific program, the subtitle text and an opening line, and calculating mismatch probability between the subtitle text and the opening line; and corpus selection means 42 for calculating an error rate of a corpus candidate subtitle text by the mismatch probability associated the corresponding pattern between the corpus candidate recognition hypotheses obtained by speech recognition of the corpus candidate program speech serving as a candidate of a speech language corpus and the corpus candidate subtitle text, and selecting the corpus candidate program speech of an utterance section having the error rate of a threshold or less and the corpus candidate subtitle text as the speech language corpus.SELECTED DRAWING: Figure 1
机译:解决的问题:提供一种生成用于学习特定节目的语音识别中使用的声学模型的语音语言语料库的语音语言语料库生成装置。解决方案:语音语言语料库生成装置1包括:失配概率计算装置12,其为关联于字幕文本和识别假设之间的对应模式,所述识别假设基于识别特定节目的语音,字幕文本和开头行的识别假设,并计算字幕文本和开头行之间的不匹配概率;语料库选择装置42,用于通过与作为语音语言语料库的候选者的语料库候选节目语音的语音识别而获得的语料库候选识别假设之间的对应模式相关联的不匹配概率,来计算语料库候选字幕文本的错误率。语料库候选字幕文本,并选择错误率小于或等于阈值的发声部分的语料库候选程序语音,并将语料库候选字幕文本作为语音语言语料库。选图:图1

著录项

  • 公开/公告号JP6637332B2

    专利类型

  • 公开/公告日2020-01-29

    原文格式PDF

  • 申请/专利权人 日本放送協会;

    申请/专利号JP20160031925

  • 发明设计人 奥 貴裕;萩原 愛子;佐藤 庄衛;

    申请日2016-02-23

  • 分类号G10L15/06;

  • 国家 JP

  • 入库时间 2022-08-21 11:32:29

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号