首页> 外国专利> The silent mixed normal distribution model and the following mixed normal distribution model which are formed beforehand in every

The silent mixed normal distribution model and the following mixed normal distribution model which are formed beforehand in every

机译:每种情况下预先形成的静默混合正态分布模型和以下混合正态分布模型

摘要

The processing efficiency and estimation accuracy of a voice activity detection apparatus are improved. An acoustic signal analyzer receives a digital acoustic signal containing a speech signal and a noise signal, generates a non-speech GMM and a speech GMM adapted to a noise environment, by using a silence GMM and a clean-speech GMM in each frame of the digital acoustic signal, and calculates the output probabilities of dominant Gaussian distributions of the GMMs. A speech state probability to non-speech state probability ratio calculator calculates a speech state probability to non-speech state probability ratio based on a state transition model of a speech state and a non-speech state, by using the output probabilities; and a voice activity detection unit judges, from the speech state probability to non-speech state probability ratio, whether the acoustic signal in the frame is in the speech state or in the non-speech state and outputs only the acoustic signal in the speech state.
机译:语音活动检测设备的处理效率和估计精度得以提高。声学信号分析仪接收包含语音信号和噪声信号的数字声学信号,并通过在语音帧的每个帧中使用静音GMM和纯语音GMM来生成适合语音环境的非语音GMM和语音GMM。数字声信号,并计算GMM的主要高斯分布的输出概率。语音状态概率到非语音状态概率比计算器基于语音状态和非语音状态的状态转移模型,通过使用输出概率来计算语音状态概率到非语音状态概率比。语音活动检测单元根据语音状态概率与非语音状态概率之比,判断帧中的声音信号处于语音状态还是非语音状态,仅输出语音状态的声音信号。

著录项

  • 公开/公告号JP5411936B2

    专利类型

  • 公开/公告日2014-02-12

    原文格式PDF

  • 申请/专利权人 日本電信電話株式会社;

    申请/专利号JP20110523623

  • 发明设计人 中谷 智広;藤本 雅清;

    申请日2010-07-15

  • 分类号G10L15/04;G10L15/20;G10L21/0308;

  • 国家 JP

  • 入库时间 2022-08-21 16:14:34

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号