首页> 美国卫生研究院文献>other >Robust speech perception: Recognize the familiar generalize to the similar and adapt to the novel

【2h】

Robust speech perception: Recognize the familiar generalize to the similar and adapt to the novel

机译：健壮的言语感知能力：识别熟悉的事物泛化成相似的事物并适应小说

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Successful speech perception requires that listeners map the acoustic signal to linguistic categories. These mappings are not only probabilistic, but change depending on the situation. For example, one talker’s /p/ might be physically indistinguishable from another talker’s /b/ (cf. lack of invariance). We characterize the computational problem posed by such a subjectively non-stationary world and propose that the speech perception system overcomes this challenge by (1) recognizing previously encountered situations, (2) generalizing to other situations based on previous similar experience, and (3) adapting to novel situations. We formalize this proposal in the ideal adapter framework: (1) to (3) can be understood as inference under uncertainty about the appropriate generative model for the current talker, thereby facilitating robust speech perception despite the lack of invariance. We focus on two critical aspects of the ideal adapter. First, in situations that clearly deviate from previous experience, listeners need to adapt. We develop a distributional (belief-updating) learning model of incremental adaptation. The model provides a good fit against known and novel phonetic adaptation data, including perceptual recalibration and selective adaptation. Second, robust speech recognition requires listeners learn to represent the structured component of cross-situation variability in the speech signal. We discuss how these two aspects of the ideal adapter provide a unifying explanation for adaptation, talker-specificity, and generalization across talkers and groups of talkers (e.g., accents and dialects). The ideal adapter provides a guiding framework for future investigations into speech perception and adaptation, and more broadly language comprehension.

机译：成功的语音感知要求听众将声音信号映射到语言类别。这些映射不仅是概率的，而且会根据情况而变化。例如，一个说话者的/ p /在身体上可能与另一个说话者的/ b /在物理上没有区别（参见缺乏不变性）。我们刻画了这种主观非平稳世界带来的计算问题，并提出语音感知系统克服了这一挑战，方法是：（1）识别先前遇到的情况，（2）根据以前的类似经验归纳为其他情况，以及（3）适应新情况。我们在理想的适配器框架中正式提出该建议：（1）到（3）可以理解为在不确定条件下对当前讲话者的适当生成模型进行推理，从而尽管缺乏不变性也有助于增强语音感知能力。我们专注于理想适配器的两个关键方面。首先，在明显偏离以往经验的情况下，听众需要适应。我们开发了渐进式适应的分布式（信念更新）学习模型。该模型可以很好地拟合已知和新颖的语音适应数据，包括感知性重新校准和选择性适应。第二，鲁棒的语音识别要求听众学会代表语音信号中跨情境变异性的结构化成分。我们将讨论理想适配器的这两个方面如何为适应性，讲话者特定性以及讲话者和讲话者群体（例如口音和方言）的泛化提供统一的解释。理想的适配器为将来的语音感知和适应以及更广泛的语言理解研究提供了指导框架。

著录项

期刊名称 other
作者
Dave F. Kleinschmidt; T. Florian Jaeger;
展开▼
作者单位

展开▼
年(卷),期 -1(122),2
年度 -1
页码 148–203
总页数 115
原文格式 PDF
正文语种
中图分类
关键词
speech perception generalization adaptation statistical learning hierarchical structure lack of invariance non-stationarity;

机译：言语感知;泛化;适应;统计学习;层次结构;不变性缺乏;不稳定;

相似文献

外文文献
中文文献
专利

1. Recognizing familiar objects by hand and foot: Haptic shape perception generalizes to inputs from unusual locations and untrained body parts [J] . Rebecca Lawson Attention, perception & psychophysics . 2014,第2期

机译：通过手和脚识别熟悉的物体：触觉形状感知可概括为来自异常位置和未经训练的身体部位的输入
2. Recognizing articulatory gestures from speech for robust speech recognition [J] . Mitra V., Nam H., Espy-Wilson C., The Journal of the Acoustical Society of America . 2012,第3aPta1期

机译：识别语音中的发音手势以实现可靠的语音识别
3. Recognizing speech in a novel accent: The motor theory of speech perception reframed [J] . Moulin-Frier C., Arbib M.A. Biological Cybernetics: Communication and Control in Organisms and Automata: = Nachrichtenubertragung, Nachrichtenverarbeitung, Steuerung und Regelung in Organismen und in Automaten . 2013,第4期

机译：以新颖的口音识别语音：语音感知的运动理论重构
4. Comparison of Effects of Acoustic and Language Knowledge on Spontaneous Speech Perception/Recognition between Human and Automatic Speech Recognizer [C] . Norihide Kitaoka, Masahisa Shingu, Seiichi Nakagawa, European Conference on Speech Communication and Technology . 2003

机译：声学和语言知识对人与自动语音识别器自发言语感知/识别的影响
5. Perception of familiar melodies and tonal speech by Taiwanese pediatric cochlear implant recipients. [D] . Hsiao, Fei-Lin. 2006

机译：台湾小儿人工耳蜗植入者的熟悉旋律和音调语音。
6. Effects of cross-language voice training on speech perception: Whose familiar voices are more intelligible? [O] . Susannah V. Levi, Stephen J. Winters, David B. Pisoni -1

机译：跨语言语音训练对语音感知的影响：谁的熟悉语音更易懂？
7. Speech recognizer-based microphone array processing for robust hands-free speech recognition [O] . Michael L. Seltzer, Bhiksha Raj, Richard M. Stern 2002

机译：基于语音识别器的麦克风阵列处理，实现强大的免提语音识别
8. Recognizing Articulatory Gestures from Speech for Robust Speech Recognition. [R] . C. Espy-Wilson E. Saltzman H. Nam L. Goldstein V. Mitra 2012

机译：从语音识别衔接手势以获得强大的语音识别能力。

Robust speech perception: Recognize the familiar generalize to the similar and adapt to the novel

摘要

著录项

相似文献

相关主题

期刊订阅