首页> 外文学位 >Perception-based multi-resolution auditory processing of acoustic signals.
【24h】

Perception-based multi-resolution auditory processing of acoustic signals.

机译:基于感知的声音信号多分辨率听觉处理。

获取原文
获取原文并翻译 | 示例

摘要

A multi-resolution auditory model is proposed to simulate the spectrotemporal processing of the primary auditory cortex. Inspired by recent physiological findings, the model produces a multi-dimensional representation of cortical activity. Though several nonlinear operations are involved, the inversion of the representation is obtained by applying convex projection technique. A series of psychoacoustical experiments were conducted to estimate the appropriate units for the axes of this auditory model. The "perceptual distance" measure, which was derived from the subjective results, outperforms the independent channel model in threshold prediction tasks. Additionally, a simplified vocal tract model was employed to explore the articulatory equivalence to the cortical axes. This study suggests that both local and global changes in the geometry of the vocal tract result in meaningful changes in the cortical response. The perceptual distance measure, when applied to vowel recognition and timbre quantification, yields better performance than conventional signal processing techniques. Given enough computing power, this perception-based auditory model can be used in many applications like speech recognition, audio coding, and sound identification.
机译:提出了一种多分辨率听觉模型来模拟主要听觉皮层的光谱时间处理。受近期生理学发现的启发,该模型产生了皮质活动的多维表示。尽管涉及几个非线性运算,但是通过应用凸投影技术可以得到表示的反演。进行了一系列的心理声学实验,以估计该听觉模型的轴的适当单位。从主观结果得出的“感知距离”量度在阈值预测任务中优于独立通道模型。另外,采用简化的声道模型来探究皮层轴的关节等效性。这项研究表明,声道几何形状的局部和整体变化都会导致皮质反应发生有意义的变化。当应用于元音识别和音色量化时,感知距离测量比常规信号处理技术产生更好的性能。有了足够的计算能力,这种基于感知的听觉模型就可以用于许多应用中,例如语音识别,音频编码和声音识别。

著录项

  • 作者

    Ru, Po-Wen.;

  • 作者单位

    University of Maryland, College Park.;

  • 授予单位 University of Maryland, College Park.;
  • 学科 Engineering Electronics and Electrical.;Psychology Psychometrics.;Physics Acoustics.
  • 学位 Ph.D.
  • 年度 2000
  • 页码 215 p.
  • 总页数 215
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;声学;心理学研究方法;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号