首页> 外文期刊>Neurocomputing >Time-frequency Representations In Speech Perception
【24h】

Time-frequency Representations In Speech Perception

机译:语音感知中的时频表示

获取原文
获取原文并翻译 | 示例
       

摘要

Nowadays applications demand a comprehensive view of voice and speech perception to build more complex and competitive procedures amenable of extracting as much knowledge from sound-based human communication as possible. Many knowledge-extraction tasks from speech and voice may share signal treatment procedures which can be devised under the point of view of bio-inspiration. The present paper examines a hierarchy of sound processing functionalities at the auditory and perceptual levels on the Auditory Neural pathways which can be translated into bio-inspired speech-processing techniques, their fundamental characteristics being analyzed in relation with current tendencies in cognitive audio processing. The pathways linking the peripheral auditory system (cochlear complex) with the brain cortex are briefly examined, with special attention to the study of neuronal structures showing specific capabilities under the point of view of formant analysis and the build-up of a semantic hierarchy from the time-frequency structure of speech to explore their capability of conveying semantics to speech processing and understanding from the minimal acoustic clues with elementary meaning or "sematoms". The replication of known biological functionality by algorithmic methods through bio-inspiration is a secondary aim of the research. Examples extracted from speech processing tasks in the domain of acoustic-phonetics are presented. These may find applicability in speech recognition, speaker's characterization and biometry, emotion detection, and others related.
机译:如今,应用程序需要对语音和语音感知有一个全面的了解,以建立更复杂和更具竞争力的程序,以从基于声音的人类交流中提取尽可能多的知识。许多从语音和语音中提取知识的任务可能会共享信号处理程序,这些程序可以从生物灵感的角度来设计。本文研究了在听觉和知觉水平上听觉神经通路上的声音处理功能的层次结构,这些层次可以转化为生物启发的语音处理技术,并分析它们的基本特征与当前认知音频处理的趋势。简要检查了连接周围听觉系统(耳蜗复合体)和大脑皮层的途径,并特别注意在共振峰分析和语义层次构建的角度下研究显示特定功能的神经元结构。语音的时频结构,以探讨它们将语义传达给语音处理和从具有基本含义或“语义”的最小声音线索中进行理解的能力。通过生物学启发通过算法方法复制已知生物学功能是研究的第二目标。提出了从声学领域的语音处理任务中提取的示例。这些可以在语音识别,说话者的表征和生物测定,情感检测以及其他相关方面找到适用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号