首页> 美国卫生研究院文献>other >Robust speech perception: Recognize the familiar generalize to the similar and adapt to the novel
【2h】

Robust speech perception: Recognize the familiar generalize to the similar and adapt to the novel

机译:健壮的言语感知能力:识别熟悉的事物泛化成相似的事物并适应小说

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Successful speech perception requires that listeners map the acoustic signal to linguistic categories. These mappings are not only probabilistic, but change depending on the situation. For example, one talker’s /p/ might be physically indistinguishable from another talker’s /b/ (cf. lack of invariance). We characterize the computational problem posed by such a subjectively non-stationary world and propose that the speech perception system overcomes this challenge by (1) recognizing previously encountered situations, (2) generalizing to other situations based on previous similar experience, and (3) adapting to novel situations. We formalize this proposal in the ideal adapter framework: (1) to (3) can be understood as inference under uncertainty about the appropriate generative model for the current talker, thereby facilitating robust speech perception despite the lack of invariance. We focus on two critical aspects of the ideal adapter. First, in situations that clearly deviate from previous experience, listeners need to adapt. We develop a distributional (belief-updating) learning model of incremental adaptation. The model provides a good fit against known and novel phonetic adaptation data, including perceptual recalibration and selective adaptation. Second, robust speech recognition requires listeners learn to represent the structured component of cross-situation variability in the speech signal. We discuss how these two aspects of the ideal adapter provide a unifying explanation for adaptation, talker-specificity, and generalization across talkers and groups of talkers (e.g., accents and dialects). The ideal adapter provides a guiding framework for future investigations into speech perception and adaptation, and more broadly language comprehension.
机译:成功的语音感知要求听众将声音信号映射到语言类别。这些映射不仅是概率的,而且会根据情况而变化。例如,一个说话者的/ p /在身体上可能与另一个说话者的/ b /在物理上没有区别(参见缺乏不变性)。我们刻画了这种主观非平稳世界带来的计算问题,并提出语音感知系统克服了这一挑战,方法是:(1)识别先前遇到的情况,(2)根据以前的类似经验归纳为其他情况,以及(3)适应新情况。我们在理想的适配器框架中正式提出该建议:(1)到(3)可以理解为在不确定条件下对当前讲话者的适当生成模型进行推理,从而尽管缺乏不变性也有助于增强语音感知能力。我们专注于理想适配器的两个关键方面。首先,在明显偏离以往经验的情况下,听众需要适应。我们开发了渐进式适应的分布式(信念更新)学习模型。该模型可以很好地拟合已知和新颖的语音适应数据,包括感知性重新校准和选择性适应。第二,鲁棒的语音识别要求听众学会代表语音信号中跨情境变异性的结构化成分。我们将讨论理想适配器的这两个方面如何为适应性,讲话者特定性以及讲话者和讲话者群体(例如口音和方言)的泛化提供统一的解释。理想的适配器为将来的语音感知和适应以及更广泛的语言理解研究提供了指导框架。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号