首页> 外国专利> Unnatural prosody detection in speech synthesis

Unnatural prosody detection in speech synthesis

机译:语音合成中的非自然韵律检测

摘要

Described is a technology by which synthesized speech generated from text is evaluated against a prosody model (trained offline) to determine whether the speech will sound unnatural. If so, the speech is regenerated with modified data. The evaluation and regeneration may be iterative until deemed natural sounding. For example, text is built into a lattice that is then (e.g., Viterbi) searched to find a best path. The sections (e.g., units) of data on the path are evaluated via a prosody model. If the evaluation deems a section to correspond to unnatural prosody, that section is replaced, e.g., by modifying/pruning the lattice and re-performing the search. Replacement may be iterative until all sections pass the evaluation. Unnatural prosody detection may be biased such that during evaluation, unnatural prosody is falsely detected at a higher rate relative to a rate at which unnatural prosody is missed.
机译:描述了一种技术,通过该技术可以根据韵律模型(离线训练)评估从文本生成的合成语音,以确定语音是否听起来不自然。如果是这样,将使用修改后的数据重新生成语音。评估和重新生成可能是迭代的,直到被认为是自然的声音为止。例如,将文本内置到一个格子中,然后搜索该格子(例如Viterbi)以找到最佳路径。经由韵律模型评估路径上的数据部分(例如,单位)。如果评估认为某节对应于不自然的韵律,则将该节替换,例如,通过修改/修剪晶格并重新执行搜索。在所有部分均通过评估之前,可能需要反复进行替换。非自然韵律检测可能有偏差,以致在评估期间,相对于遗漏非自然韵律的比率,以较高的比率错误地检测到非自然韵律。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号