...
首页> 外文期刊>Acta polytechnica >A Pitch Detection Algorithm for Continuous Speech Signals Using Viterbi Traceback with Temporal Forgetting
【24h】

A Pitch Detection Algorithm for Continuous Speech Signals Using Viterbi Traceback with Temporal Forgetting

机译:基于时间遗忘的维特比回溯的连续语音信号基音检测算法

获取原文
   

获取外文期刊封面封底 >>

       

摘要

This paper presents a pitch-detection algorithm (PDA) for application to signals containing continuous speech. The core of the method is based on merged normalized forward-backward correlation (MNFBC) working in the time domain with the ability to make basic voicing decisions. In addition, the Viterbi traceback procedure is used for post-processing the MNFBC output considering the three best fundamental frequency (F0) candidates in each step. This should make the final pitch contour smoother, and should also prevent octave errors. In transition probabilities computation between F0 candidates, two major improvements were made over existing post-processing methods. Firstly, we compare pitch distance in musical cent units. Secondly, temporal forgetting is applied in order to avoid penalizing pitch jumps after prosodic pauses of one speaker or changes in pitch connected with turn-taking in dialogs. Results computed on a pitchreference database definitely show the benefit of the first improvement, but they have not yet proved any benefits of temporal modification. We assume this only happened due to the nature of the reference corpus, which had a small amount of suprasegmental content.
机译:本文提出了一种音调检测算法(PDA),适用于包含连续语音的信号。该方法的核心是基于在时域中工作的合并归一化前向后相关(MNFBC),并能够做出基本的语音决策。此外,考虑到每个步骤中的三个最佳基本频率(F0)候选者,使用维特比追溯程序对MNFBC输出进行后处理。这应该使最终音高轮廓更平滑,并且还应防止八度音阶误差。在F0候选者之间的转移概率计算中,对现有的后处理方法进行了两项重大改进。首先,我们比较以音分为单位的音高距离。其次,为了避免惩罚一个扬声器的音调暂停或与对话中的转弯有关的音高变化,应用了时间遗忘功能。在基音参考数据库上计算出的结果无疑显示了第一个改进的好处,但尚未证明临时修改有任何好处。我们认为这仅是由于参考语料库的性质而发生的,该参考语料库具有少量的超节段内容。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号