首页> 外国专利> A METHOD AND A SYSTEM FOR DETECTING VOICE ACTIVITY BASED ON A COMPLEX GAMMA STATISTICAL MODEL

A METHOD AND A SYSTEM FOR DETECTING VOICE ACTIVITY BASED ON A COMPLEX GAMMA STATISTICAL MODEL

机译:基于复杂伽玛统计模型的语音活动检测方法及系统

摘要

A method and an apparatus for VAD based on a complex gamma statistical model are provided to use the complex gamma statistical model instead of an existing Gaussian statistical model, thereby improving VAD performance according to types of noise and SNR(Signal to Noise Ratio) conditions. A method for VAD(Voice Activity Detection) based on a complex gamma statistical model comprises the following steps of: converting an inputted voice signal into a voice signal of a frequency domain by performing FFT(Fast Fourier Transform) of the inputted voice signal; estimating power of a noise signal from the voice signal converted into the frequency domain; deciding a likelihood ratio on the frequency domain on the assumption that a DFT(Discrete Fourier Transform) coefficient of clean speech and noise follows the complex gamma statistical model with respect to existence and nonexistence of the voice based on the estimated power of the noise signal; and calculating a decision rule for the VAD from the decided likelihood ratio.
机译:提供了一种基于复杂伽玛统计模型的用于VAD的方法和设备,以使用复杂伽玛统计模型代替现有的高斯统计模型,从而根据噪声和SNR(信噪比)条件的类型来改善VAD性能。一种基于复杂伽玛统计模型的VAD(语音活动检测)方法,包括以下步骤:通过对输入语音信号进行FFT(快速傅立叶变换),将输入语音信号转换为频域语音信号。从转换成频域的语音信号估计噪声信号的功率;基于干净的语音和噪声的DFT(离散傅里叶变换)系数遵循复杂的伽玛统计模型,基于噪声信号的估计功率,在语音的存在和不存在方面,确定频域上的似然比;并根据所确定的似然比来计算VAD的决策规则。

著录项

  • 公开/公告号KR100718749B1

    专利类型

  • 公开/公告日2007-05-15

    原文格式PDF

  • 申请/专利权人

    申请/专利号KR20060118896

  • 发明设计人 장준혁;

    申请日2006-11-29

  • 分类号G10L19/04;G10L11/06;G10L21/02;G10L19;

  • 国家 KR

  • 入库时间 2022-08-21 20:32:12

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号