首页> 外国专利> EVALUATING SPEECH INTELLIGIBILITY OF TEXT-TO-SPEECH SYNTHESIS USING TEMPLATECONSTRAINED GENERALIZED POSTERIOR PROBABILITY

EVALUATING SPEECH INTELLIGIBILITY OF TEXT-TO-SPEECH SYNTHESIS USING TEMPLATECONSTRAINED GENERALIZED POSTERIOR PROBABILITY

机译:使用模板|广义广义概率估计文本到语音合成的语音可理解性

摘要

Instead of relying on humans to subjectively evaluate speech intelligibility of a subject, a system objectively evaluates the speech intelligibility. The system receives speech input and calculates confidence scores at multiple different levels using a Template Constrained Generalized Posterior Probability algorithm. One or multiple intelligibility classifiers are utilized to classify the desired entities on an intelligibility scale. A specific intelligibility classifier utilizes features such as the various confidence scores. The scale of the intelligibility classification can be adjusted to suit the application scenario. Based on the confidence score distributions and the intelligibility classification results at multiple levels an overall objective intelligibility score is calculated. The objective intelligibility scores can be used to rank different subjects or systems being assessed according to their intelligibility levels. The speech that is below a predetermined intelligibility (e.g. utterances with low confidence scores and most severe intelligibility issues) can be automatically selected for further analysis.
机译:系统不依赖于人类来主观地评估对象的语音清晰度,而是客观地评估语音清晰度。该系统使用模板约束广义后验概率算法接收语音输入并计算多个不同级别的置信度得分。一个或多个清晰度分类器被用于以清晰度等级对期望的实体进行分类。特定的清晰度分类器利用各种置信度得分等功能。清晰度等级的大小可以根据应用场景进行调整。基于置信度分数分布和多个级别的清晰度分类结果,可以计算出总体客观清晰度分数。客观清晰度分数可用于根据其清晰度水平对要评估的不同主题或系统进行排名。可以自动选择低于预定清晰度的语音(例如具有低置信度分数和最严重的清晰度问题的话语)以进行进一步分析。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号