【24h】

Human Audition as Statistical Recognition System for Audio Signals

机译:人类听觉作为音频信号的统计识别系统

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, the results of the study concerning modeling the processes of human perception and recognition of audio signals are discussed. Based on the analysis of reliable psychophysical data, a model (conception) is synthesized within the framework of the Neyman-Pearson approach, which is well-known in the theory of statistical decisions. This model allows a universal view of numerous facts and dependences that have been established by now in relation to the auditory system. A sequential recognition procedure was developed around solid data on the structure and functions of the auditory system. Optimization of the recognition procedure revealed that the perceptually meaningful feature set representing an audio signal consists of sampled values of component envelopes taken in certain time instants. These instants depend on the pattern for which the similarity hypothesis is tested. In the general case, the time instants are not equidistant, contrary to the existing speech recognition techniques. We show how this peculiarity is related to the well-known psychophysical Weber-Fechner law. Theoretical study of the recognition procedure is supplemented by the discussion of possible realization issues. It is shown that the realization of the procedure, especially recursive realization, results in fast and efficient numerical algorithms. These algorithms can be naturally realized on structures similar to neural networks. The relation is considered between recursive realization and well-established recognition techniques, such as LCP (PLP) and MFCC.
机译:在本文中,讨论了关于对人类感知和识别音频信号的过程进行建模的研究结果。基于对可靠的心理物理数据的分析,在Neyman-Pearson方法的框架内综合了一个模型(构想),这在统计决策理论中是众所周知的。该模型允许对目前与听觉系统有关的许多事实和依存关系具有统一的看法。围绕听觉系统的结构和功能的可靠数据开发了顺序识别程序。识别程序的优化表明,代表音频信号的在感知上有意义的特征集由在特定时刻获取的分量包络的采样值组成。这些瞬间取决于测试相似性假设的模式。在一般情况下,与现有的语音识别技术相反,时刻不是等距的。我们展示了这种特殊性与众所周知的心理物理学韦伯-费希纳定律之间的关系。对识别过程的理论研究得到了可能实现问题的讨论的补充。结果表明,该过程的实现,尤其是递归实现,导致了快速有效的数值算法。这些算法可以在类似于神经网络的结构上自然实现。在递归实现和公认的识别技术(例如LCP(PLP)和MFCC)之间考虑了这种关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号