首页> 外文会议>International Conference on Text, Speech and Dialogue(TSD 2006); 20060911-15; Brno(CZ) >Using Prosody for Automatic Sentence Segmentation of Multi-party Meetings
【24h】

Using Prosody for Automatic Sentence Segmentation of Multi-party Meetings

机译:使用韵律进行多方会议的句子自动分段

获取原文
获取原文并翻译 | 示例

摘要

We explore the use of prosodic features beyond pauses, including duration, pitch, and energy features, for automatic sentence segmentation of ICSI meeting data. We examine two different approaches to boundary classification: score-level combination of independent language and prosodic models using HMMs, and feature-level combination of models using a boosting-based method (BoosTexter). We report classification results for reference word transcripts as well as for transcripts from a state-of-the-art automatic speech recognizer (ASR). We also compare results using the lexical model plus a pause-only prosody model, versus results using additional prosodic features. Results show that (1) information from pauses is important, including pause duration both at the boundary and at the previous and following word boundaries; (2) adding duration, pitch, and energy features yields significant improvement over pause alone; (3) the integrated boosting-based model performs better than the HMM for ASR conditions; (4) training the boosting-based model on recognized words yields further improvement.
机译:我们探究了使用韵律特征(包括持续时间,音调和能量特征)来进行暂停,以实现ICSI会议数据的自动句子分割。我们研究了两种不同的边界分类方法:使用HMM的独立语言和韵律模型的得分级组合,以及使用基于Boosting的方法(BoosTexter)的模型的特征级组合。我们报告参考词笔录以及来自最先进的自动语音识别器(ASR)的笔录的分类结果。我们还将使用词汇模型和仅暂停韵律模型的结果与使用其他韵律特征的结果进行比较。结果表明:(1)来自暂停的信息很重要,包括边界以及前后单词边界的暂停持续时间; (2)增加持续时间,音调和能量特征比单独的暂停产生了明显的改善; (3)在ASR条件下,集成的基于Boosting的模型的性能优于HMM; (4)在识别的单词上训练基于Boosting的模型产生了进一步的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号