Binaural source separation based on spatial cues and maximum likelihood model adaptation

Abdipour Roohollah; Akbari Ahmad; Rahmani Mohsen; Nasersharif Babak

首页> 外文期刊>Digital Signal Processing >Binaural source separation based on spatial cues and maximum likelihood model adaptation

【24h】

Binaural source separation based on spatial cues and maximum likelihood model adaptation

机译：基于空间线索和最大似然模型自适应的双源分离

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes a system for separating multiple moving sound sources from two-channel recordings based on spatial cues and a model adaptation technique. We employ a statistical model of observed interaural level and phase differences, where maximum likelihood estimation of model parameters is achieved through an expectation-maximization algorithm. This model is used to partition spectrogram points into several clusters (one cluster per source) and generate spectrogram masks accordingly for isolating individual sound sources. We follow a maximum likelihood linear regression (MLLR) approach for tracking source relocations and adapting model parameters accordingly. The proposed algorithm is able to separate more sources than input channels, i.e. in the underdetermined setting. In simulated anechoic and reverberant environments with two and three speakers, the proposed model-adaptation algorithm yields more than 10 dB gain in signal-to-noise-ratio-improvement for azimuthal source relocations of 15 degrees or more. Moreover, this performance gain is achievable with only 0.6 seconds of input mixture received after relocation. (C) 2014 Elsevier Inc. All rights reserved.

机译：本文介绍了一种基于空间线索和模型自适应技术从两个通道的录音中分离出多个移动声源的系统。我们采用观察到的听觉水平和相位差的统计模型，其中模型参数的最大似然估计是通过期望最大化算法实现的。该模型用于将频谱图点划分为几个群集（每个源一个群集），并相应地生成频谱图掩码，以隔离各个声源。我们遵循最大似然线性回归（MLLR）方法来跟踪源重定位并相应地调整模型参数。所提出的算法能够分离比输入通道更多的源，即在未确定的设置中。在带有两个和三个扬声器的模拟回声和混响环境中，对于15度或以上的方位源重定位，所提出的模型自适应算法在信噪比改善中产生的增益超过10 dB。而且，在重新定位后仅接收0.6秒的输入混合物，就可以实现这种性能提升。（C）2014 Elsevier Inc.保留所有权利。

著录项

来源
《Digital Signal Processing》 |2015年第null期|共10页
作者
Abdipour Roohollah; Akbari Ahmad; Rahmani Mohsen; Nasersharif Babak;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类数字信号处理;
关键词
Binaural source separation; Model adaptation; Maximum likelihood linear regression; Statistical signal processing; Speech enhancement;

机译：双源分离;模型自适应;最大似然线性回归;统计信号处理;语音增强;

相似文献

外文文献
中文文献
专利

1. Binaural source separation based on spatial cues and maximum likelihood model adaptation [J] . Abdipour Roohollah, Akbari Ahmad, Rahmani Mohsen, Digital Signal Processing . 2015,第Null期

机译：基于空间线索和最大似然模型自适应的双源分离
2. Clustering of spatial cues by semantic segmentation for anechoic binaural source separation [J] . Gul Sania, Fulaly Muhammad Sheryar, Khan Muhammad Salman, Applied Acoustics . 2021,第Jana期

机译：通过语义分割对空间线索进行聚类，以进行化学双耳源分离
3. Two dimentional DOA estimation of sound sources based on the binaural model and its application on concurrent speech separation [J] . Tsuyoshi Usagawa, Takashi Nakanishi, Hidetoshi Nakashima, 電子情報通信学会技術研究報告. 信号処理. Signal Processing . 2003,第55期

机译：基于双耳模型的声源二维DOA估计及其在并发语音分离中的应用
4. Source separation based on binaural cues and source model constraints [C] . Ron J. Weiss, Michael I. Mandel, Daniel P W. Ellis International Speech Communication Association . 2008

机译：基于双耳提示和源模型约束的源分离
5. Binaural model-based source separation and localization. [D] . Mandel, Michael I. 2010

机译：基于双耳模型的源分离和本地化。
6. Blind Source Separation Method Based on Neural Network with Bias Term and Maximum Likelihood Estimation Criterion [O] . Sheng Liu, Bangmin Wang, Lanyong Zhang 2021

机译：基于偏置术语的神经网络的盲源分离方法和最大似然估计标准
7. Source Separation Based on Binaural Cues and Source Model Constraints [O] . Weiss Ron J., Mandel Michael I., Ellis Daniel P. W. 2009

机译：基于双耳线索和源模型约束的源分离

Binaural source separation based on spatial cues and maximum likelihood model adaptation

摘要

著录项

相似文献

相关主题

期刊订阅