首页> 外文期刊>IEEE transactions on audio, speech and language processing >A Method for Automatic Detection of Vocal Fry
【24h】

A Method for Automatic Detection of Vocal Fry

机译:一种自动检测人声炸薯条的方法

获取原文
获取原文并翻译 | 示例

摘要

Vocal fry (also called creak, creaky voice, and pulse register phonation) is a voice quality that carries important linguistic or paralinguistic information, depending on the language. We propose a set of acoustic measures and a method for automatically detecting vocal fry segments in speech utterances. A glottal pulse-synchronized method is proposed to deal with the very low fundamental frequency properties of vocal fry segments, which cause problems in the classic short-term analysis methods. The proposed acoustic measures characterize power, aperiodicity, and similarity properties of vocal fry signals. The basic idea of the proposed method is to scan for local power peaks in a “very short-term” power contour for obtaining glottal pulse candidates, check for periodicity properties, and evaluate a similarity measure between neighboring glottal pulse candidates for deciding the possibility of being vocal fry pulses. In the periodicity analysis, autocorrelation peak properties are taken into account for avoiding misdetection of periodicity in vocal fry segments. Evaluation of the proposed acoustic measures in the automatic detection resulted in 74% correct detection, with an insertion error rate of 13%.
机译:声调(也称为吱吱声,吱吱作响的声音和脉冲注册发声)是一种语音质量,根据语言的不同,它会携带重要的语言或副语言信息。我们提出了一套声学措施和一种用于自动检测语音发声中的炒段的方法。提出了一种声门脉冲同步方法来处理声带段的极低基频特性,这在经典的短期分析方法中引起了问题。拟议的声学措施表征了声带油炸信号的功率,非周期性和相似性。所提出方法的基本思想是在“非常短期”的功率轮廓中扫描局部功率峰值,以获得声门脉冲候选,检查周期性特性,并评估相邻声门脉冲候选之间的相似性度量,以确定是否可能发生声门脉冲。被人声炒出来。在周期性分析中,考虑了自相关峰的属性,以避免误判声带段中的周期性。对自动检测中建议的声学措施的评估导致74%的正确检测正确,插入错误率为13%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号