首页> 外文会议>IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics >COMPUTATIONAL AUDITORY SCENE ANALYSIS EXPLOITING SPEECH-RECOGNITION KNOWLEDGE
【24h】

COMPUTATIONAL AUDITORY SCENE ANALYSIS EXPLOITING SPEECH-RECOGNITION KNOWLEDGE

机译:剥削语音识别知识的计算听觉场景分析

获取原文

摘要

The field of computational auditory scene analysis (CASA) strives to build computer models of the human ability to interpret sound mixtures as the combination of distinct sources. A major obstacle to this enterprise is defining and incorporating the kind of high level knowledge of real-world signal structure exploited by listeners. Speech recognition, while typically ignoring the problem of nonspeech inclusions, has been very successful at deriving powerful statistical models of speech structure from training data. In this paper, we describe a scene analysis system that includes both speech and nonspeech components, addressing the problem of working backwards from speech recognizer output to estimate the speech component of a mixture. Ultimately, such hybrid approaches will require more radical adaptation of current speech recognition approaches.
机译:计算听觉场景分析(CASA)领域努力构建人类能力的计算机模型,以将声音混合物解释为独特源的组合。本企业的主要障碍是定义和纳入听众利用的现实世界信号结构的高级知识。语音识别,虽然通常忽略非静音夹杂物的问题,但在从训练数据中导出了语音结构的强大统计模型非常成功。在本文中,我们描述了一种场景分析系统,包括语音和非诊断组件,解决从语音识别器输出后向后工作以估计混合的语音组件。最终,这种混合方法需要更自然的语音识别方法的自由基调整。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号