首页> 外文期刊>Speech Communication >A combined punctuation generation and speech recognition system and its performance enhancement using prosody
【24h】

A combined punctuation generation and speech recognition system and its performance enhancement using prosody

机译:标点符号生成与语音识别相结合的系统及其使用韵律的性能增强

获取原文
获取原文并翻译 | 示例

摘要

A punctuation generation system which combines prosodic information with acoustic and language model information is presented. Experiments have been conducted for both the reference text transcriptions and speech recogniser outputs. For the reference transcription, prosodic information of acoustic data is shown to be more useful than language model information. Several straightforward modifications of a conventional speech recogniser allow the system to produce punctuation and speech recognition hypotheses simultaneously. The multiple hypotheses produced by the automatic speech recogniser are then re-scored using prosodic information. When the prosodic information is incorporated, the F-measure (defined as harmonic mean of recall and precision) can be improved. This speech recognition system including punctuation gives a small reduction in word error rate on the 1-best speech recognition output including punctuation. An alternative approach for generating punctuation from the un-punctuated 1-best speech recognition output is also proposed. The results from these two alternative schemes are compared.
机译:提出了一种将韵律信息与声学和语言模型信息相结合的标点生成系统。已经针对参考文本转录和语音识别器输出进行了实验。对于参考转录,声音数据的韵律信息比语言模型信息更有用。常规语音识别器的几种直接修改允许系统同时产生标点符号和语音识别假设。然后使用韵律信息对自动语音识别器产生的多个假设进行评分。当包含韵律信息时,可以改善F量度(定义为召回率和查准率的谐波均值)。包括标点符号的这种语音识别系统在包括标点符号的1个最佳语音识别输出上的单词错误率几乎没有降低。还提出了一种从未打孔的1最佳语音识别输出生成标点的替代方法。比较了这两种替代方案的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号