【2h】

Research in speech communication.

机译:语音交流研究。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Advances in digital speech processing are now supporting application and deployment of a variety of speech technologies for human/machine communication. In fact, new businesses are rapidly forming about these technologies. But these capabilities are of little use unless society can afford them. Happily, explosive advances in microelectronics over the past two decades have assured affordable access to this sophistication as well as to the underlying computing technology. The research challenges in speech processing remain in the traditionally identified areas of recognition, synthesis, and coding. These three areas have typically been addressed individually, often with significant isolation among the efforts. But they are all facets of the same fundamental issue--how to represent and quantify the information in the speech signal. This implies deeper understanding of the physics of speech production, the constraints that the conventions of language impose, and the mechanism for information processing in the auditory system. In ongoing research, therefore, we seek more accurate models of speech generation, better computational formulations of language, and realistic perceptual guides for speech processing--along with ways to coalesce the fundamental issues of recognition, synthesis, and coding. Successful solution will yield the long-sought dictation machine, high-quality synthesis from text, and the ultimate in low bit-rate transmission of speech. It will also open the door to language-translating telephony, where the synthetic foreign translation can be in the voice of the originating talker.
机译:现在,数字语音处理技术的进步正在支持各种语音技术在人机交互中的应用和部署。实际上,有关这些技术的新业务正在迅速形成。但是,除非社会能够承受,否则这些功能几乎没有用。令人高兴的是,过去二十年来微电子学的爆炸性发展确保了人们可以负担得起这种复杂性以及基础计算技术的费用。语音处理方面的研究挑战仍然存在于传统确定的识别,合成和编码领域。这三个领域通常是单独解决的,通常在工作之间存在重大隔离。但是它们都是同一基本问题的方面,即如何表示和量化语音信号中的信息。这意味着需要对语音产生的物理学,语言惯例所施加的约束以及听觉系统中信息处理的机制有更深入的了解。因此,在正在进行的研究中,我们寻求更准确的语音生成模型,更好的语言计算公式以及用于语音处理的逼真的感知指导,以及结合识别,合成和编码等基本问题的方法。成功的解决方案将产生人们期待已久的听写机,文本的高质量合成以及最终的低比特率语音传输。它还将为翻译语言的电话敞开大门,合成的外国翻译可以在发话者的声音中表达。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号