首页> 外文会议>INTERSPEECH 2012 >Sentence Detection Using Multiple Annotations
【24h】

Sentence Detection Using Multiple Annotations

机译:使用多个注释检测句子检测

获取原文

摘要

In this paper, we develop a sentence boundary detection system which incorporates a prosodic model, word and preterminal-level language models, and a global sentence-length model. An important aspect of this research was the investigation of crowdsourced punctuation annotations as a source of multiple references for evaluation purposes. In order to evaluate the system we propose a BLEU-like metric which compares a hypothesis to multiple references. Experiments on both transcription and ASR output show that the global sentence length model can improve the performance by 7.2% on reference transcripts and 3.8% on ASR output.
机译:在本文中,我们开发了一个句子边界检测系统,该系统包含韵律模型,单词和前级语言模型和全局句子长度模型。该研究的一个重要方面是调查众包标点符号注释作为评估目的的多个参考资料来源。为了评估系统,我们提出了一种类似的Bleu形度量,该度量将假设与多个引用进行比较。转录和ASR输出的实验表明,全局句子长度模型可以在参考转录程序上提高7.2%的性能,ASR输出上的3.8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号