首页> 外文会议>IEEE International Conference on Cognitive Infocommunications >A prosody inspired RNN approach for punctuation of machine produced speech transcripts to improve human readability
【24h】

A prosody inspired RNN approach for punctuation of machine produced speech transcripts to improve human readability

机译:受韵律启发的RNN方法标点机器产生的语音成绩单,以提高人类可读性

获取原文

摘要

Speech communication human-machine interfaces exploit automatic speech recognition to implement speech-to-text conversion. Unfortunately, in the past, not much effort has been devoted to add punctuation marks to the recognized word chain after speech recognition. This affects human readability and makes interpretation hard. This paper presents an effort to restore punctuation marks by keeping low the latency resulting from this post-processing step. The approach exploits the prosodic structure and proposes a sequential modelling paradigm based on recurrent neural networks. Results show satisfying punctuation restoration abilities, especially taking into account that sentence boundaries are reliably detected. Even if the predicted punctuation sequence is not error free w.r.t. writing standards, human perception is expected to “repair” these errors more easily compared to the case when no punctuation is given at all and the reader is left in confusion regarding the basic segmentation of the word chain.
机译:语音通信人机界面利用自动语音识别来实现语音到文本的转换。不幸的是,过去,在语音识别之后,没有花费太多的努力来将标点符号添加到已识别的单词链上。这会影响人类的可读性并使解释变得困难。本文提出了一种通过保持较低的后处理步骤延迟来恢复标点符号的方法。该方法利用了韵律结构,并提出了基于递归神经网络的顺序建模范例。结果显示令人满意的标点恢复能力,特别是考虑到可靠地检测到句子边界的情况。即使预测的标点序列不是无错误的与根本没有标点符号并且读者对单词链的基本分段感到困惑的情况相比,在编写标准时,人们的感知期望更容易“修复”这些错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号