首页> 外文期刊>IEICE transactions on information and systems >Robust Voice Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation
【24h】

Robust Voice Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation

机译:基于谐波调频特性的鲁棒语音活动检测算法及其DSP实现

获取原文
       

摘要

This paper proposes a voice activity detection (VAD) algorithm based on an energy related feature of the frequency modulation of harmonics. A multi-resolution spectro-temporal analysis framework, which was developed to extract texture features of the audio signal from its Fourier spectrogram, is used to extract frequency modulation features of the speech signal. The proposed algorithm labels the voice active segments of the speech signal by comparing the energy related feature of the frequency modulation of harmonics with a threshold. Then, the proposed VAD is implemented on one of Texas Instruments (TI) digital signal processor (DSP) platforms for real-time operation. Simulations conducted on the DSP platform demonstrate the proposed VAD performs significantly better than three standard VADs, ITU-T G.729B, ETSI AMR1 and AMR2, in non-stationary noise in terms of the receiver operating characteristic (ROC) curves and the recognition rates from a practical distributed speech recognition (DSR) system.
机译:本文提出了一种基于谐波频率调制的能量相关特征的语音活动检测(VAD)算法。开发了一种多分辨率频谱时态分析框架,该框架用于从其傅立叶频谱图中提取音频信号的纹理特征,用于提取语音信号的频率调制特征。所提出的算法通过将谐波频率调制的能量相关特征与阈值进行比较,来标记语音信号的语音活动段。然后,建议的VAD在Texas Instruments(TI)数字信号处理器(DSP)平台之一上实现,以实现实时操作。在DSP平台上进行的仿真表明,就接收机工作特性(ROC)曲线和识别率而言,非平稳噪声方面,拟议的VAD的性能明显优于ITU-T G.729B,ETSI AMR1和AMR2三个标准VAD。来自实用的分布式语音识别(DSR)系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号