首页> 外文OA文献 >Blind estimation of the number of speech source in reverberant multisource scenarios based on binaural signals
【2h】

Blind estimation of the number of speech source in reverberant multisource scenarios based on binaural signals

机译:基于双耳信号的混响多源场景中的语音源数量的盲估计

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this paper we present a new approach for estimating the number of active speech sources in the presence of interfering noise sources and reverberation. First, a binaural front-end is used to detect the spatial positions of all active sound sources, resulting in a binary mask for each candidate position. Then, each candidate position is characterized by a set of features. In addition to exploiting the overall spectral shape, a new set of mask-based features is proposed which aims at characterizing the pattern of the estimated binary mask. The decision stage for detecting a speech source is based on a support vector machine (SVM) classifier. A systematic analysis shows that the proposed algorithm is able to blindly determine the number and the corresponding spatial positions of speech sources in multisource scenarios and generalizes well to unknown acoustic conditions
机译:在本文中,我们提出了一种新的方法,用于在存在干扰噪声源和混响的情况下估计活动语音源的数量。首先,使用双耳前端检测所有活动声源的空间位置,从而为每个候选位置生成二进制掩码。然后,每个候选位置由一组特征来表征。除了利用整体频谱形状外,还提出了一组新的基于掩模的特征,旨在表征估计的二进制掩模的图案。用于检测语音源的决策阶段基于支持向量机(SVM)分类器。系统分析表明,该算法能够在多源场景中盲目确定语音源的数量和对应的空间位置,并且可以很好地推广到未知的声学条件

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号