An Unsupervised Approach to Cochannel Speech Separation

Hu K.; Wang D.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >An Unsupervised Approach to Cochannel Speech Separation

【24h】

An Unsupervised Approach to Cochannel Speech Separation

机译：同道语音分离的无监督方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Cochannel (two-talker) speech separation is predominantly addressed using pretrained speaker dependent models. In this paper, we propose an unsupervised approach to separating cochannel speech. Our approach follows the two main stages of computational auditory scene analysis: segmentation and grouping. For voiced speech segregation, the proposed system utilizes a tandem algorithm for simultaneous grouping and then unsupervised clustering for sequential grouping. The clustering is performed by a search to maximize the ratio of between- and within-group speaker distances while penalizing within-group concurrent pitches. To segregate unvoiced speech, we first produce unvoiced speech segments based on onset/offset analysis. The segments are grouped using the complementary binary masks of segregated voiced speech. Despite its simplicity, our approach produces significant SNR improvements across a range of input SNR. The proposed system yields competitive performance in comparison to other speaker-independent and model-based methods.

机译：同频道（两个通话者）的语音分离主要使用预先训练的说话者相关模型来解决。在本文中，我们提出了一种用于分离同频道语音的无监督方法。我们的方法遵循计算听觉场景分析的两个主要阶段：分割和分组。对于有声语音隔离，提出的系统利用串联算法进行同时分组，然后使用无监督的分组进行顺序分组。通过搜索执行聚类，以最大化组内和组内说话者距离之比，同时惩罚组内并发音调。为了区分清音语音，我们首先根据开始/偏移分析生成清音语音段。使用隔离的有声语音的互补二进制掩码对片段进行分组。尽管它很简单，但我们的方法在整个输入SNR范围内都能显着提高SNR。与其他独立于说话者和基于模型的方法相比，该系统具有竞争优势。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2013年第1期|p.120-129|共10页
作者
Hu K.; Wang D.;
展开▼
作者单位

Department of Computer Science and Engineering, The Ohio State University, Columbus, OH, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Computational auditory scene analysis (CASA); cochannel speech separation; sequential grouping; unsupervised clustering; unvoiced speech segregation;

机译：计算听觉场景分析（CASA）;同道语音分离;顺序分组;无监督聚类;清语音分离;

相似文献

外文文献
中文文献
专利

1. An iterative model-based approach to cochannel speech separation [J] . Ke Hu, DeLiang Wang EURASIP Journal on Audio, Speech, and Music Processing . 2013,第1期

机译：基于迭代模型的同信道语音分离方法
2. An iterative model-based approach to cochannel speech separation [J] . Ke Hu, DeLiang Wang EURASIP journal on audio, speech, and music processing . 2013,第1期

机译：基于迭代模型的同信道语音分离方法
3. An unsupervised approach for co-channel speech separation using Hilbert-Huang transform and Fuzzy C-Means clustering [J] . M. K. Prasanna Kumar, R. Kumaraswamy International journal of speech technology . 2017,第1期

机译：使用Hilbert-Huang变换和Fuzzy C-Means聚类的无监督语音分离方法
4. Unsupervised sequential organization for cochannel speech separation [C] . Ke Hu, DeLiang Wang Annual conference of the International Speech Communication Association;INTERSPEECH 2010 . 2011

机译：用于同道语音分离的无监督顺序组织
5. Separation of cochannel signals under imperfect timing and carrier synchronization. [D] . Kannan, Anand. 1999

机译：不完善的时序和载波同步下的同信道信号分离。
6. A Real-Time Speech Separation Method Based on Camera and Microphone Array Sensors Fusion Approach [O] . Ching-Feng Liu, Wei-Siang Ciou, Peng-Ting Chen, 2020

机译：基于摄像头和麦克风阵列传感器融合方法的实时语音分离方法
7. An unsupervised approach to cochannel speech separation [O] . Ke Hu, Student Member 2013

机译：一种无监督的同声道语音分离方法

An Unsupervised Approach to Cochannel Speech Separation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅