An Adaptive Non Reference Anchor Array Framework for Audio Retrieval in Teleconferencing Environment

Karan Nathwani; Arpit Shukla; Shubham Khunteta; Rajesh M. Hegde

首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >An Adaptive Non Reference Anchor Array Framework for Audio Retrieval in Teleconferencing Environment

【24h】

An Adaptive Non Reference Anchor Array Framework for Audio Retrieval in Teleconferencing Environment

机译：电话会议环境中用于音频检索的自适应非参考锚阵列框架

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, an adaptive framework for audio retrieval in live teleconferencing environments with multiple participants is proposed. The framework uses a non reference anchor array (NRA) to capture the interfering speech sources, in addition to the primary array that captures the speech source of interest (SOI). A linearly constrained-minimum variance (LC-MV) beamformer is used herein such that the signal coming from the look direction is preserved while interferences coming from the non look direction are nulled. Additionally, the reverberant component of the speech acquired by this framework is removed by a novel method that uses the linear prediction (LP) residual cepstrum. This method does not require the computation of the acoustic impulse response (AIR) of the teleconferencing room and hence is computationally efficient. The NRA framework is therefore able to remove correlated noise coming from the direction of the SOI and also dereverberating the noise free signal. The performance of the proposed framework is evaluated by conducting experiments on clean speech acquisition from distant microphone arrays. Experiments on distant speech recognition are also conducted using the TIMIT and MONC databases. Experimental results obtained from the proposed framework indicate a reasonable improvement over correlation, subspace and standard minimum variance beam-forming methods. The application of the framework in audio retrieval in a live teleconferencing environment with multiple participants is also discussed.

机译：本文提出了一种具有多个参与者的实时电话会议环境中的音频检索自适应框架。除了捕获感兴趣的语音源（SOI）的主要阵列之外，该框架还使用非参考锚点阵列（NRA）捕获干扰的语音源。这里使用线性约束最小方差（LC-MV）波束形成器，使得来自视向的信号被保留，而来自非视向的干扰被消除。此外，通过使用线性预测（LP）残留倒谱的新颖方法，可以消除此框架获取的语音的混响分量。此方法不需要计算电话会议厅的声脉冲响应（AIR），因此计算效率很高。因此，NRA框架可以消除来自SOI方向的相关噪声，并且可以消除无噪声信号的干扰。通过对来自远距离麦克风阵列的干净语音进行实验，评估了所提出框架的性能。还使用TIMIT和MONC数据库进行了远程语音识别的实验。从提出的框架获得的实验结果表明，在相关性，子空间和标准最小方差波束形成方法方面有合理的改进。还讨论了该框架在具有多个参与者的实时电话会议环境中的音频检索中的应用。

著录项

来源
《Journal of signal processing systems for signal, image, and video technology》 |2014年第1期|91-102|共12页
作者
Karan Nathwani; Arpit Shukla; Shubham Khunteta; Rajesh M. Hegde;
展开▼
作者单位

Department of Electrical Engineering, Indian Institute of Technology, Kanpur 16, India;

Department of Electrical Engineering, Indian Institute of Technology, Kanpur 16, India;

Department of Electrical Engineering, Indian Institute of Technology, Kanpur 16, India;

Department of Electrical Engineering, Indian Institute of Technology, Kanpur 16, India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Non reference anchor array; LP residual cepstrum; Adaptive beamforming;

机译：非参考锚数组;LP残留倒频谱自适应波束成形;

相似文献

外文文献
中文文献
专利

1. Object-based Audio: Audio services adapting to viewer's preferences and listening environments [J] . Hiroki Kubo Broadcast technology . 2020,第80期

机译：基于对象的音频：适应Viewer的首选项和侦听环境的音频服务
2. A simple framework to calculate the reaching definition of array references and its use in subscript array analysis [J] . Yuan Lin, David Padua CONCURRENCY PRACTICE & EXPERIENCE . 2000,第2a3期

机译：计算数组引用的到达定义的简单框架及其在下标数组分析中的使用
3. ANCHOR AUDIO BEACON PORTABLE LINE ARRAY SOUND SYSTEM [J] . John McJunkin Sound & video contractor . 2012,第10期

机译：锚点音频信标便携式线阵音响系统
4. Joint adaptive beamforming and echo cancellation using a non reference anchor array framework [C] . Nathwani Karan, Hegde Rajesh M Asilomar Conference on Signals, Systems and Computers . 2012

机译：使用非参考锚阵列框架的联合自适应波束成形和回声消除
5. An Adaptive Retrieval Framework for Multi-Turn Retrieval-Based Chatbots [D] . Wang, Disen. 2021

机译：基于多转检索的Chatbots的自适应检索框架
6. A Comparison of the Acceptability and Effectiveness of Two Methods of Distance Education: CD-ROM and Audio Teleconferencing [O] . James E. De Muth, Ruth H. Bruskiewitz 2006

机译：CD-ROM和音频电话会议这两种远程教育方法的可接受性和有效性的比较
7. Interactive Audio Content: An Approach to Audio Content for a Dynamic Museum Experience through Augmented Audio Reality and Adaptive Information Retrieval [O] . Wakkary Ron, Newby Kenneth, Hatala Marek, 2004

机译：交互式音频内容：通过增强的音频真实性和自适应信息检索来获得动态博物馆体验的音频内容的方法
8. Software Framework for Image Retrieval and Visual Understanding in Dynamic and Sensor Rich Environments. [R] . Lesch, N. C. 2017

机译：动态和传感器丰富环境中的图像检索和视觉理解的软件框架。

An Adaptive Non Reference Anchor Array Framework for Audio Retrieval in Teleconferencing Environment

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅