Real-time processing using the frequency domain binaural model

Yoshifumi Chisaki; Kotaro Matsuo; Katsumori Hagiwara; Hidetoshi Nakashima; Tsuyoshi Usagawa

首页> 外文期刊>Applied Acoustics >Real-time processing using the frequency domain binaural model

【24h】

Real-time processing using the frequency domain binaural model

机译：使用频域双耳模型进行实时处理

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

There are many approaches to achieving high-performance speech enhancement. The modeling of the human auditory system is a good approach, since human beings can focus on target speech under concurrent speech conditions. One example of the binaural models is the time domain binaural model. However, this model has a high-calculation cost because the algorithm is based on auto-correlation, which is computationally intensive. Another example is the frequency domain binaural model proposed by Nakashima et al. [Nakashima H, Chisaki Y, Usagawa T, Ebata M. Frequency domain binaural model based on interaural phase and level differences. Acoust Sci Technol 2003;24(4):172-8]. Since the frequency domain binaural model uses the fast fourier transform, the calculation cost is much lower than that of the time domain binaural model. Therefore, it is not difficult to perform real-time processing using recent hardware such as digital signal processors and even laptop personal computers. However the quality of the segregated sound obtained using the frequency domain binaural model depends on system parameters such as frequency resolution and frame shift length for overlap adding in time domain. This paper introduces the construction of a prototype of a hearing assistant system based on the frequency domain binaural model. The detailed implementation techniques and parameter tuning are mentioned. The proposed system runs in real-time after parameter tuning. The directional attenuation levels, that is, the directivity patterns of the proposed system is measured. Finally, it is shown that the prototype can extract sounds coming from specific directions in real-time.

机译：有许多方法可以实现高性能语音增强。人类听觉系统的建模是一种很好的方法，因为人类可以在并发语音条件下专注于目标语音。双耳模型的一个例子是时域双耳模型。但是，该模型的计算成本很高，因为该算法基于自相关，而自相关是计算密集型的。另一个例子是中岛等人提出的频域双耳模型。 [Nakashima H，Chisaki Y，Usagawa T，EbataM。基于耳间相位和电平差异的频域双耳模型。 Acoust Sci Technol 2003; 24（4）：172-8]。由于频域双耳模型使用快速傅里叶变换，因此计算成本大大低于时域双耳模型。因此，使用诸如数字信号处理器甚至膝上型个人计算机之类的最新硬件执行实时处理并不困难。但是，使用频域双耳模型获得的分离声音的质量取决于系统参数，例如频率分辨率和时域重叠叠加的移码长度。本文介绍了基于频域双耳模型的助听器系统原型的构建。提到了详细的实现技术和参数调整。拟议的系统在参数调整后实时运行。测量方向衰减水平，即所提出系统的方向性图。最后，表明原型可以实时提取来自特定方向的声音。

著录项

来源
《Applied Acoustics》 |2007年第8期|p.923-938|共16页
作者
Yoshifumi Chisaki; Kotaro Matsuo; Katsumori Hagiwara; Hidetoshi Nakashima; Tsuyoshi Usagawa;
展开▼
作者单位

Department of Computer Science, Faculty of Engineering, Kumamoto University, 2-39-1 Kurokami, Kumamoto 860-8555, Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类声学;
关键词
frequency domain binaural model; real-time processing; linux;

机译：频域双耳模型;实时处理;Linux;
入库时间 2022-08-17 13:32:08

相似文献

外文文献
中文文献
专利

1. Real-time implementation of Frequency Domain Binaural Model and its basic characteristics [J] . Takashi NAKANISHI, Rika MATSUQt, Hidetoshi NAKASHIMA, 電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics . 2003,第252期

机译：频域双耳模型的实时实现及其基本特征
2. Real-time implementation of Frequency Domain Binaural Model and its basic characteristics [J] . Takashi NAKANISHI, Rika MATSUQt, Hidetoshi NAKASHIMA, 電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics . 2003,第252期

机译：频域双耳模型的实时实现及其基本特征
3. Effects of interaural delay, center frequency, and no more than "slight" hearing loss on precision of binaural processing: Empirical data and quantitative modeling [J] . Bernstein Leslie R., Trahiotis Constantine The Journal of the Acoustical Society of America . 2018,第1期

机译：内部延迟，中心频率，不超过“轻微”听力损失对双耳加工精度的影响：经验数据和定量造型
4. Binaural hearing assisting system with spatial selectivitybased on the frequency domain binaural model [C] . Y. Chisaki, R. Matsuo, H. Nakashima, International congress and exposition on noise control engineering;Inter-noise 2004 . 2004

机译：基于频域双耳模型的具有空间选择性的双耳助听系统
5. Efficient smart algorithms and architectures for real-time video transmission in pixel and frequency domains. [D] . Ismail, Yasser Ali. 2010

机译：用于像素和频域中实时视频传输的高效智能算法和体系结构。
6. Modeling Binaural Unmasking of Speech Using a Blind Binaural Processing Stage [O] . Christopher F. Hauth, Simon C. Berning, Birger Kollmeier, 2020

机译：使用盲双个加工阶段建模双耳揭露语音
7. Real-time optical properties and oxygenation imaging using custom parallel processing in the spatial frequency domain [O] . Enagnon Aguénounon, Foudil Dadouche, Wilfried Uhring, 2019

机译：使用空间频域中定制并行处理的实时光学性质和氧合成像

Real-time processing using the frequency domain binaural model

摘要

著录项

相似文献

相关主题

期刊订阅