Speech Separation of Multiple Moving Speakers Using Multisensor Multitarget Techniques

Potamitis I.; Kokkinakis G.

首页> 外文期刊>IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans >Speech Separation of Multiple Moving Speakers Using Multisensor Multitarget Techniques

【24h】

Speech Separation of Multiple Moving Speakers Using Multisensor Multitarget Techniques

机译：使用多传感器多目标技术的多个移动扬声器的语音分离

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The general problem addressed in this paper is that of separating the voices of active moving speakers in the presence of background noise and moderate reverberation level in the acoustic field using a single microphone array. We adapt the multisensor multitarget tracking theory to the context of microphone arrays in order to form receptive beams that lock on each moving speaker on an extended time basis and therefore, achieve voice separation. Our approach: 1) incorporates kinematical information of speakers' movement by using an interacting multiple model (IMM) estimator per speaker in order to constrain the evolution of direction of arrival (DOA) measurements, which characterize various motions of the speakers, and 2) can directly account for measurement origin uncertainty, i.e., which measurement comes from which speaker, by using the probabilistic-data-association technique in conjunction with the IMM estimator. The effectiveness of the approach is illustrated by an extensive simulation study on tracking the DOAs of two speakers with crossing trajectories and three static speakers having a conversation with partially overlapping speech and long pauses

机译：本文解决的一般问题是使用单个麦克风阵列在存在背景噪声和适度混响水平的情况下在声场中分离主动移动扬声器的声音。我们将多传感器多目标跟踪理论应用于麦克风阵列的环境，以便形成可以在延长的时间基础上锁定在每个移动扬声器上的接收束，从而实现语音分离。我们的方法：1）通过使用每个发言者的交互多模型（IMM）估计器来合并发言者运动的运动学信息，以限制到达方向（DOA）测量的演变，以表征发言者的各种动作，以及2）通过结合使用概率数据关联技术和IMM估计器，可以直接考虑测量起点的不确定性，即哪个测量来自哪个说话者。该方法的有效性通过广泛的模拟研究得到了证明，该研究跟踪了具有交叉轨迹的两个说话者和三个静态说话者的对话的DOA，对话中的会话具有部分重叠的语音和长时间的停顿

著录项

来源
《IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans》 |2007年第2007期|p.72-81|共10页
作者
Potamitis I.; Kokkinakis G.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化基础理论;
关键词
direction-of-arrival estimation; microphone arrays; sensor fusion; speech processing; target tracking; direction of arrival measurements; multiple moving speakers; multisensor multitarget techniques; probabilistic-data-association technique; single microphone arr;

机译：到达方向估计;麦克风阵列;传感器融合;语音处理;目标跟踪;到达方向测量;多个移动扬声器;多传感器多目标技术;概率数据关联技术;单个麦克风arr;

相似文献

外文文献
中文文献
专利

1. Speech Separation of Multiple Moving Speakers Using Multisensor Multitarget Techniques [J] . Potamitis I., Kokkinakis G. IEEE transactions on systems, man, and cybernetics. Part A, Systems and humans . 2007,第1期

机译：使用多传感器多目标技术的多个移动扬声器的语音分离
2. Multitarget-Multisensor Tracking: Principles and Techniques [Book Review] [J] . Daum F. IEEE Aerospace and Electronic Systems Magazine . 1996,第2期

机译：多目标多传感器跟踪：原理和技术[书评]
3. Multitarget-Multisensor Tracking: Principles and Techniques [BOOKSHELF] [J] . IEEE Control Systems Magazine . 1996,第1期

机译：多目标多传感器跟踪：原理和技术[书架]
4. Blind Speech Separation of Moving Speakers Using Hybrid Neural Networks [C] . Athanasios Koutras, Evangelos Dermatas, George Kokkinakis European conference on speech communication and technology . 2001

机译：使用混合神经网络的移动扬声器的盲目分离
5. Optimum techniques in multisensor multitarget tracking and track association. [D] . Giannopoulos, Evangelos H. 1999

机译：多传感器多目标跟踪和跟踪关联中的最佳技术。
6. Long short-term memory for speaker generalization in supervised speech separation [O] . Jitong Chen, DeLiang Wang -1

机译：长时短时记忆用于监督语音分离中的说话人泛化
7. Blind Speech Separation Of Moving Speakers In Real Reverberant Environments [O] . A. Koutras, E. Dermatas, G. Kokkinakis 2007

机译：真实混响环境中移动扬声器的盲语音分离
8. Transcription of Multiple Speakers Using Speaker Dependent Speech Recognition [R] . 2003

机译：使用说话人相关语音识别转录多个扬声器

Speech Separation of Multiple Moving Speakers Using Multisensor Multitarget Techniques

摘要

著录项

相似文献

相关主题

期刊订阅