【24h】

Syllable-Level Prominence Detection with Acoustic Evidence

机译:音级音节突出检测

获取原文

摘要

Accurate prominence annotation benefits many spoken language understanding tasks as well as speech synthesis. In this work, we conduct a thorough study using acoustic prosodic cues for prominence detection in speech. This study is different from previous work in several aspects. In addition to the widely used prosodic features, such as pitch, energy, and duration, we introduce the use of cepstral features. Furthermore, we evaluate the effect of different features, speaker dependency and variation, different classifiers, and contextual information. Our experiments on the Boston University Radio News Corpus show that although the cepstral features alone do not perform well, when combined with prosodic features they yield some performance gain and, more importantly, can reduce much of the speaker variation in this task. We find that the previous context is more informative than the following context, and their combination achieves the best performance. The final result using selected features with context information is significantly better than that in previous work.
机译:准确的突出注释可以使许多口语理解任务以及语音合成受益。在这项工作中,我们使用声学韵律线索进行深入研究,以检测语音中的突出部分。这项研究与以前的工作在几个方面有所不同。除了音调,能量和持续时间等广泛使用的韵律特征外,我们还介绍了倒频谱特征的使用。此外,我们评估了不同功能,说话者依存性和变异性,不同分类器和上下文信息的影响。我们在波士顿大学广播新闻语料库上进行的实验表明,尽管单独的倒谱特征不能很好地发挥作用,但是当与韵律特征结合使用时,它们会产生一定的性能提升,更重要的是,可以减少此任务中说话者的许多变化。我们发现,前一个上下文比后一个上下文更具信息性,并且它们的组合实现了最佳性能。将所选功能与上下文信息一起使用的最终结果明显优于以前的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号