Group delay based methods for recognition of distant talking speech

机译：基于群时延的远距离语音识别方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The group delay function has been used conventionally in temporal spectral analysis and feature extraction for speech recognition. In this work we present a detailed analysis of a novel approach to spatial spectral analysis of speech using the MUSIC-Group delay spectrum. In our previous work we have proposed the use of the MUSIC-Group delay spectrum [ICASSP 2010], for direction of arrival estimation (DOA) and distant speech recognition. We discuss the advantages of the proposed method in terms of resolving closely spaced speech sources with minimal number of sensors. This method is also analyzed from a minimum phase perspective as is done in temporal processing of speech. Additional analysis is performed using the Pisarenko-Group delay spectrum in terms of real time performance. DOAs estimated from the proposed approach are used to train filter and sum beamformers. Distant speech recognition experiments in clean and reverberant conditions using the beamformed speech signal indicate reasonable improvements over correlation and sub space based methods.

机译：群体延迟功能通常已用于时间频谱分析和特征提取以进行语音识别。在这项工作中，我们介绍了一种使用MUSIC-Group延迟频谱进行语音空间频谱分析的新方法的详细分析。在我们之前的工作中，我们已经建议将MUSIC组延迟频谱[ICASSP 2010]用于到达方向估计（DOA）和远距离语音识别。我们讨论了用最小数量的传感器来解决紧密间隔的语音源方面所提出的方法的优点。还从最小相位角度分析此方法，就像在语音的时间处理中所做的一样。就实时性能而言，使用Pisarenko-Group延迟频谱进行了其他分析。从提出的方法估计的DOA用于训练滤波器和求和波束形成器。使用波束赋形的语音信号在干净和混响条件下进行的远距离语音识别实验表明，在基于相关性和子空间的方法上有合理的改进。

著录项

来源
《Conference Record of the Forty Fourth Asilomar Conference on Signals, Systems and Computers》|2010年|p.1702-1706|共5页
会议地点
作者
Mandala Rohan; Shukla Mrityunjaya; Hegde Rajesh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信理论;
关键词

相似文献

外文文献
中文文献
专利

1. Simultaneous Recognition of Distant-Talking Speech of Multiple Talkers Based on the 3-D N-Best Search Method [J] . PANIKOS HERACLEOUS, SATOSHI NAKAMURA, KIYOHIRO SHIKANO Journal of VLSI signal processing . 2004,第2a3期

机译：基于3-D N-最佳搜索方法的多个通话者远距离语音的同时识别
2. Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm [J] . Longbiao WANG, Norihide KITAOKA, Seiichi NAKAGAWA IEICE Transactions on Information and Systems . 2011,第3期

机译：基于多通道LMS算法的谱相减的远距离语音识别
3. Distant-Talking Speech Recognition Based on Spectral Subtraction by Multi-Channel LMS Algorithm [J] . Longbiao WANG, Norihide KITAOKA, Seiichi NAKAGAWA IEICE transactions on information and systems . 2011,第3期

机译：基于多通道LMS算法的频谱减法的远程谈话语音识别
4. Group delay based methods for recognition of distant talking speech [C] . Mandala Rohan, Shukla Mrityunjaya, Hegde Rajesh Conference Record of the Forty-Third Asilomar Conference on Signals, Systems and Computers . 2010

机译：基于群延迟识别遥远谈话演讲的方法
5. Robust Acoustic Modeling and Front-End Design for Distant Speech Recognition [D] . Mirsamadi, Seyedmahdad. 2017

机译：鲁棒的声学建模和远端语音识别前端设计
6. Multi-Talker Speech Promotes Greater Knowledge-Based Spoken Mandarin Word Recognition in First and Second Language Listeners [O] . Seth Wiener, Chao-Yang Lee 2020

机译：多语种语音在第一语言和第二语言听众中促进基于知识的口语普通话单词识别
7. MODEL-BASED DEREVERBERATION IN THE LOGMELSPEC DOMAIN FOR ROBUST DISTANT-TALKING SPEECH RECOGNITION [O] . Armin Sehr, Walter Kellermann 2011

机译：LOGMELSPEC域中基于模型的去耦，用于鲁棒远程语音识别

Group delay based methods for recognition of distant talking speech

摘要

著录项

相似文献

相关主题

期刊订阅