首页> 外文学位 >Pitch tracking and speech enhancement in noisy and reverberant environments.
【24h】

Pitch tracking and speech enhancement in noisy and reverberant environments.

机译:在嘈杂和混响环境中的音调跟踪和语音增强。

获取原文
获取原文并翻译 | 示例

摘要

Two causes of speech degradation exist in practically all listening situations: noise interference and room reverberation. This dissertation investigates three particular aspects of speech processing in noisy and reverberant environments: multipitch tracking for noisy speech, measurement of reverberation time based on pitch strength, and reverberant speech enhancement using one microphone (or monaurally).; An effective multipitch tracking algorithm for noisy speech is critical for speech analysis and processing. However, the performance of existing algorithms is not satisfactory. We present a robust algorithm for multipitch tracking of noisy speech. Our approach integrates an improved channel and peak selection method, a new method for extracting periodicity information across different channels, and a hidden Markov model (HMM) for forming continuous pitch tracks. The resulting algorithm can reliably track single and double pitch tracks in a noisy environment. We suggest a pitch error measure for the multipitch situation. The proposed algorithm is evaluated on a database of speech utterances mixed with various types of interference. Quantitative comparisons show that our algorithm significantly outperforms existing ones.; Reverberation corrupts harmonic structure in voiced speech. We observe that the pitch strength of voiced speech segments is indicative of the degree of reverberation. Consequently, we present a pitch-based measure for reverberation time (T60) utilizing our new pitch determination algorithm. The pitch strength is measured by deriving the statistics of relative time lags, defined as the distances from the detected pitch periods to the closest peaks in correlograms. The monotonic relationship between the measured pitch strength and reverberation time is learned from a corpus of reverberant speech with known reverberation times.; Under noise-free conditions, the quality of reverberant speech is dependent on two distinct perceptual components: coloration and long-term reverberation. They correspond to two physical variables: signal-to-reverberant energy ratio (SRR) and reverberation time, respectively. We propose a two-stage reverberant speech enhancement algorithm using one microphone. In the first stage, an inverse filter is estimated to reduce coloration effects so that SRR is increased. The second stage utilizes spectral subtraction to minimize the influence of long-term reverberation. The proposed algorithm significantly improves the quality of reverberant speech. Our algorithm is quantitatively compared with a recent one-microphone reverberant speech enhancement algorithm on a corpus of speech utterances in a number of reverberant conditions. The results show that our algorithm performs substantially better.
机译:在几乎所有聆听情况下,语音质量下降的两个原因都存在:噪声干扰和房间混响。本文研究了在嘈杂和混响环境中语音处理的三个特定方面:对嘈杂语音进行多音高跟踪,基于音调强度的混响时间测量以及使用一个麦克风(或单声道)进行混响语音增强。有效的多音高语音跟踪算法对于语音分析和处理至关重要。但是,现有算法的性能不能令人满意。我们提出了一种用于嘈杂语音的多音高跟踪的鲁棒算法。我们的方法集成了改进的通道和峰值选择方法,跨不同通道提取周期性信息的新方法以及用于形成连续音高轨道的隐藏马尔可夫模型(HMM)。所得算法可以在嘈杂的环境中可靠地跟踪单音节和双音节轨道。我们建议针对多音高情况使用音高误差度量。所提出的算法在混合了各种干扰的语音话语数据库上进行评估。定量比较表明,我们的算法明显优于现有算法。混响破坏了浊音中的谐波结构。我们观察到浊音段的音调强度指示混响的程度。因此,我们使用新的音高确定算法提出了一种基于音高的混响时间测量(T 60 )。通过获得相对时滞的统计数据来测量音调强度,该相对时滞定义为从检测到的音调周期到相关图中最接近的峰值的距离。从具有已知混响时间的混响语音语料库获知测得的音高强度与混响时间之间的单调关系。在无噪声的情况下,混响语音的质量取决于两个明显的感知成分:着色和长期混响。它们分别对应两个物理变量:信号与混响能量比(SRR)和混响时间。我们提出了一种使用一个麦克风的两阶段混响语音增强算法。在第一阶段,估计逆滤波器以减少着色效果,从而增加SRR。第二阶段利用频谱减法来最小化长期混响的影响。所提出的算法大大提高了混响语音的质量。在许多混响条件下,我们的算法与最近的单麦克风混响语音增强算法在语音言语语料库上进行了定量比较。结果表明,我们的算法性能明显更好。

著录项

  • 作者

    Wu, Mingyang.;

  • 作者单位

    The Ohio State University.;

  • 授予单位 The Ohio State University.;
  • 学科 Computer Science.; Artificial Intelligence.
  • 学位 Ph.D.
  • 年度 2003
  • 页码 166 p.
  • 总页数 166
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;人工智能理论;
  • 关键词

  • 入库时间 2022-08-17 11:45:40

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号