首页> 外文会议>ITG Symposium on Speech Communication >Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering
【24h】

Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering

机译:使用前向后向卡尔曼滤波的有声语音和基音跟踪的无监督分类

获取原文

摘要

The detection of voiced speech, the estimation of the fundamental frequency and the tracking of pitch values over time are crucial subtasks for a variety of speech processing techniques. Many different algorithms have been developed for each of the three subtasks. We present a new algorithm that integrates the three subtasks into a single procedure. The algorithm can be applied to pre-recorded speech utterances in the presence of considerable amounts of background noise. We combine a collection of standard metrics, such as the zero-crossing rate for example, to formulate an unsupervised voicing classifier. The estimation of pitch values is accomplished with a hybrid autocorrelation- based technique. We propose a forward-backward Kalman filter to smooth the estimated pitch contour. In experiments we are able to show that the proposed method compares favorably with current, state-of-the-art pitch detection algorithms.
机译:有声语音的检测,基频的估计以及随着时间的推移音调值的跟踪对于各种语音处理技术都是至关重要的子任务。对于这三个子任务中的每一个,已经开发了许多不同的算法。我们提出了一种将三个子任务集成到单个过程中的新算法。在存在大量背景噪声的情况下,该算法可以应用于预先录制的语音。我们结合了一系列标准指标(例如过零率)来制定无监督的语音分类器。利用基于混合自相关的技术来完成音调值的估计。我们提出了一个前向后向卡尔曼滤波器来平滑估计的音高轮廓。在实验中,我们能够证明所提出的方法与当前最新的音调检测算法相比具有优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号