Time-frequency Representations In Speech Perception

Pedro Gomez-Vilda; Jose M. Ferrandez-Vicente; Victoria Rodellar-Biarge; Roberto Fernandez-Baillo

首页> 外文期刊>Neurocomputing >Time-frequency Representations In Speech Perception

【24h】

Time-frequency Representations In Speech Perception

机译：语音感知中的时频表示

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays applications demand a comprehensive view of voice and speech perception to build more complex and competitive procedures amenable of extracting as much knowledge from sound-based human communication as possible. Many knowledge-extraction tasks from speech and voice may share signal treatment procedures which can be devised under the point of view of bio-inspiration. The present paper examines a hierarchy of sound processing functionalities at the auditory and perceptual levels on the Auditory Neural pathways which can be translated into bio-inspired speech-processing techniques, their fundamental characteristics being analyzed in relation with current tendencies in cognitive audio processing. The pathways linking the peripheral auditory system (cochlear complex) with the brain cortex are briefly examined, with special attention to the study of neuronal structures showing specific capabilities under the point of view of formant analysis and the build-up of a semantic hierarchy from the time-frequency structure of speech to explore their capability of conveying semantics to speech processing and understanding from the minimal acoustic clues with elementary meaning or "sematoms". The replication of known biological functionality by algorithmic methods through bio-inspiration is a secondary aim of the research. Examples extracted from speech processing tasks in the domain of acoustic-phonetics are presented. These may find applicability in speech recognition, speaker's characterization and biometry, emotion detection, and others related.

机译：如今，应用程序需要对语音和语音感知有一个全面的了解，以建立更复杂和更具竞争力的程序，以从基于声音的人类交流中提取尽可能多的知识。许多从语音和语音中提取知识的任务可能会共享信号处理程序，这些程序可以从生物灵感的角度来设计。本文研究了在听觉和知觉水平上听觉神经通路上的声音处理功能的层次结构，这些层次可以转化为生物启发的语音处理技术，并分析它们的基本特征与当前认知音频处理的趋势。简要检查了连接周围听觉系统（耳蜗复合体）和大脑皮层的途径，并特别注意在共振峰分析和语义层次构建的角度下研究显示特定功能的神经元结构。语音的时频结构，以探讨它们将语义传达给语音处理和从具有基本含义或“语义”的最小声音线索中进行理解的能力。通过生物学启发通过算法方法复制已知生物学功能是研究的第二目标。提出了从声学领域的语音处理任务中提取的示例。这些可以在语音识别，说话者的表征和生物测定，情感检测以及其他相关方面找到适用性。

著录项

来源
《Neurocomputing》 |2009年第6期|p.820-830|共11页
作者
Pedro Gomez-Vilda; Jose M. Ferrandez-Vicente; Victoria Rodellar-Biarge; Roberto Fernandez-Baillo;
展开▼
作者单位

Facultad de Informatica, Universidad Politecnica de Madrid, Campus de Montegancedo, s, 28660 Boadilla del Monte, Madrid, Spain;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
bio-inspired speech processing; speech perception; acoustic-phonetics; phonetic boundaries and classes; minimal semantic units;

机译：生物启发的语音处理;语音感知;声学语音;语音边界和类别;最小语义单元;
入库时间 2022-08-18 02:08:34

相似文献

外文文献
中文文献
专利

1. Investigation of Different Time-Frequency Representations for Intelligibility Assessment of Dysarthric Speech [J] . Chandrashekar H. M., Karjigi Veena, Sreedevi N. IEEE transactions on neural systems and rehabilitation engineering . 2020,第12期

机译：对缺陷言论智能性评估不同时频表示的调查
2. Time-Frequency Feature Representation Using Multi-Resolution Texture Analysis and Acoustic Activity Detector for Real-Life Speech Emotion Recognition [J] . Kun-Ching Wang Sensors . 2015,第1期

机译：使用多分辨率纹理分析和声活动检测器进行实时语音情感识别的时频特征表示
3. Graphic Representation Method and Neural Network Recognition of Time-Frequency Vectors of Speech Information [J] . A. O. Zhirkov, D. N. Kortchagine, A. S. Lukin, Programming and Computer Software . 2003,第4期

机译：语音信息时频矢量的图形表示方法和神经网络识别
4. Perceptual Speech Enhancement Using a Hilbert Transform Based Time-Frequency Representation of Speech [C] . N. Derakhshan, M. H. Savoji International Conference on Speech and Computer . 2006

机译：基于Hilbert变换的时频表示的感知语音增强
5. Investigating the blind separation of speech mixtures using a reassigned time-frequency representation [D] . Perrotta, Salvatore P. 2009

机译：使用重新分配的时频表示调查语音混合的盲分离
6. Time-Frequency Feature Representation Using Multi-Resolution Texture Analysis and Acoustic Activity Detector for Real-Life Speech Emotion Recognition [O] . Kun-Ching Wang 2015

机译：使用多分辨率纹理分析和声活动检测器的时频特征表示用于现实生活中的语音情感识别
7. Time-Frequency Feature Representation Using Multi-Resolution Texture Analysis and Acoustic Activity Detector for Real-Life Speech Emotion Recognition [O] . Kun-Ching Wang 2015

机译：利用多分辨率纹理分析和声学活动检测器进行实时语音情感识别的时频特征表示
8. Multitalker Speech Perception with Ideal Time-Frequency Segregation: Effects of Voice Characteristics and Number of Talkers [R] . Brungart, D. S., Chang, P. S., Simpson, B. D., 2009

机译：具有理想时频分离的multitalker语音感知：语音特征和谈话者数量的影响

Time-frequency Representations In Speech Perception

摘要

著录项

相似文献

相关主题

期刊订阅