首页> 外国专利> TARGET SPEECH ESTIMATION MODEL LEARNING DEVICE, TARGET SPEECH DETERMINATION DEVICE, TARGET SPEECH ESTIMATION MODEL LEARNING METHOD, TARGET SPEECH DETERMINATION METHOD AND PROGRAM

TARGET SPEECH ESTIMATION MODEL LEARNING DEVICE, TARGET SPEECH DETERMINATION DEVICE, TARGET SPEECH ESTIMATION MODEL LEARNING METHOD, TARGET SPEECH DETERMINATION METHOD AND PROGRAM

机译：目标言语估计模型学习装置，目标言语确定装置，目标言语估计模型学习方法，目标言语确定方法和程序

页面导航

摘要
著录项
相似文献

摘要

To provide technology for determining whether uttered voice detected from input voice is that suitable for a prescribed target.SOLUTION: A target speech estimation model learning device comprises: a speech detection section for detecting uttered voice corresponding to voice which a speaker utters and extracting an acoustic feature of the uttered voice from input voice including the voice which the speaker utters and noise; a voice recognition section for generating a voice recognition result set with recognition score from the uttered voice; a vector expression generation section for generating a voice recognition result word vector expression set, and a voice recognition result part-of-speech vector expression set from the voice recognition result set with recognition score; and a target speech determination section for outputting the uttered voice and the voice recognition result set with recognition score when the uttered voice is determined to be the speech suitable for a prescribed target from the uttered voice, the acoustic feature, the voice recognition result set with recognition score, the voice recognition result word vector expression set and the voice recognition result part-of-speech vector expression set by using a target speech estimation model for outputting probability that the uttered voice detected from the input voice is the speech suitable for the prescribed target.SELECTED DRAWING: Figure 3

机译：提供一种确定从输入语音中检测出的发音是否适合于预定目标的技术。解决方案：目标语音估计模型学习设备包括：语音检测部分，用于检测与说话者说出的语音相对应的发音并提取声学输入语音中发出的语音的特征，包括扬声器发出的语音和噪音;语音识别部分，用于从发出的语音中生成具有识别分数的语音识别结果集;向量表达生成部分，用于生成语音识别结果词矢量表达集合，以及从具有识别分数的语音识别结果集中的语音识别结果词性矢量表达集合;目标语音确定部分，用于当从所述语音，声学特征，所述语音识别结果集确定所述语音为适合于预定目标的语音时，输出所述语音和具有识别分数的语音识别结果集。识别分数，语音识别结果词向量表达集和语音识别结果词性向量表达集通过使用目标语音估计模型来输出从输入语音中检测到的发声语音是适合于规定语音的概率target.SELECTED DRAWING：图3

著录项

公开/公告号JP2019139000A

专利类型
公开/公告日2019-08-22

原文格式PDF
申请/专利权人 NIPPON TELEGR & TELEPH CORP NTT;
展开▼

申请/专利号JP20180020773
发明设计人 NAKAMURA TAKASHI;FUKUTOMI TAKAAKI;
展开▼

申请日2018-02-08
分类号G10L15/22;G10L15/16;G10L15/06;
国家 JP
入库时间 2022-08-21 12:24:12

相似文献

专利
外文文献
中文文献