首页> 外文会议>International conference on text, speech, and dialogue >Inserting Punctuation to ASR Output in a Real-Time Production Environment
【24h】

Inserting Punctuation to ASR Output in a Real-Time Production Environment

机译:在实时生产环境中将标点符号插入ASR输出

获取原文

摘要

The output of a speech recognition system is a continuous stream of words that has to be post-processed in various ways, out of which punctuation insertion is an essential step. Punctuated text is far more comprehensible to the reader, can be used for subtitling, and is necessary for further NLP processing, such as machine translation. In this article, we describe how state-of-the-art results in the field of punctuation restoration can be utilized in a production-ready business environment in the Czech language. A recurrent neural network based on long short-term memory is employed, making use of various features: textual based on pre-trained word embeddings, prosodic (mainly temporal), morphological, noise information, and speaker diarization. All the features except morphological tags were found to improve our baseline system. As we work in a real-time setup, it is not possible to employ information from the future of the word stream, yet we achieve significant improvements using LSTM. The usage of RNN also allows the model to learn longer dependencies than any n-gram-based language model can, which we find essential for the insertion of question marks. The deployment of an RNN-based model thus leads to a relative 22.6% decrease in punctuation errors and improvement in all metrics but one.
机译:语音识别系统的输出是一个连续的单词流,必须以各种方式对其进行后处理,其中插入标点符号是必不可少的步骤。标点符号的文本对于读者来说更容易理解,可以用于字幕,并且对于进一步的NLP处理(例如机器翻译)是必需的。在本文中,我们描述了如何在捷克语的生产就绪型业务环境中利用标点符号还原领域的最新成果。利用基于长短期记忆的递归神经网络,它利用了各种功能:基于预训练单词嵌入的文本,韵律(主要是时间的),形态,噪声信息和说话者区分。发现除了形态标记之外的所有功能都可以改善我们的基线系统。当我们在实时设置中工作时,不可能使用来自字流未来的信息,但是我们使用LSTM取得了显着改进。 RNN的使用还使该模型比任何基于n-gram的语言模型都可以学习更长的依赖关系,我们发现这对于插入问号至关重要。因此,基于RNN的模型的部署导致标点符号错误相对减少22.6%,除一个指标外,所有指标均得到改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号