首页> 外文会议> >Speaker- and language-independent speech recognition in mobile communication systems
【24h】

Speaker- and language-independent speech recognition in mobile communication systems

机译:移动通信系统中与说话者和语言无关的语音识别

获取原文

摘要

We investigate the technical challenges that are faced when making a transition from the speaker-dependent to speaker-independent speech recognition technology in mobile communication devices. Due to globalization as well as the international nature of the markets and the future applications, speaker independence implies the development and use of language-independent automatic speech recognition (ASR) to avoid logistic difficulties. We propose an architecture for embedded multilingual speech recognition systems. Multilingual acoustic modeling, automatic language identification, and on-line pronunciation modeling are the key features which enable the creation of truly language- and speaker-independent ASR applications with dynamic vocabularies and sparse implementation resources. Our experimental results confirm the viability of the proposed architecture. While the use of multilingual acoustic models degrades the recognition rates only marginally, a recognition accuracy decrease of approximately 4% is observed due to sub-optimal on-line text-to-phoneme mapping and automatic language identification. This performance loss can nevertheless be compensated by applying acoustic model adaptation techniques.
机译:我们研究了在移动通信设备中从依赖于说话者的语音识别技术向不依赖于说话者的语音识别技术过渡时面临的技术挑战。由于全球化以及市场的国际性质和未来的应用,说话人的独立性意味着开发和使用独立于语言的自动语音识别(ASR)可以避免后勤困难。我们提出了一种嵌入式多语言语音识别系统的体系结构。多语言声学建模,自动语言识别和在线发音建模是关键特性,它们可以创建具有动态词汇量和稀疏实现资源的真正独立于语言和说话者的ASR应用程序。我们的实验结果证实了所提出体系结构的可行性。尽管使用多语言声学模型仅会略微降低识别率,但由于次优的在线文本到音素映射和自动语言识别,识别精度下降了约4%。但是,可以通过应用声学模型自适应技术来补偿这种性能损失。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号