首页> 外文会议>Annual conference of the International Speech Communication Association >A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions
【24h】

A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions

机译:一种简单有效的方法,可将很长的语音信号对准听觉上不完美的转录

获取原文

摘要

In the framework of a contract with the Basque Parliament for subtitling the videos of bilingual plenary sessions, which basically consisted of aligning very long (around 3 hours long) audio tracks with syntactically correct but acoustically inaccurate text transcriptions (since all the disfluencies, mistakes, etc. were edited), a very simple and efficient procedure (avoiding the need for language nor lexical models, which was key because of the mix of languages) was developed as a first approach, before trying more complex schemes found in the literature. Since it worked pretty well and the output was quite satisfactory for the intended application, that simple approach was finally chosen. In this paper, we describe the approach in detail and apply it to a widely known annotated dataset (specifically, to the 1997 Hub4 task), to allow the comparison to a reference approach. Results demonstrate that our approach provides only slightly worse segmentations at a much lower computational cost and requiring much fewer resources. Moreover, if the resource to be segmented includes speech in two or more languages and speakers conmute between them at any time, applying a speech recognizer becomes unfeasible in practice, whereas our approach can be still applied with no additional cost.
机译:在与巴斯克议会签订的合同中,对双语全体会议的视频进行字幕处理的过程中,基本上包括将很长(约3小时长)的音轨与句法正确但听觉上不准确的文本转录对齐(因为所有的疏忽,错误,首先,开发了一种非常简单有效的程序(避免使用语言或词汇模型,这是由于语言混合而成为关键)的方法,然后才尝试使用文献中发现的更复杂的方案。由于它工作得很好并且输出对于预期的应用程序来说是令人满意的,因此最终选择了这种简单的方法。在本文中,我们将详细描述该方法,并将其应用于广泛已知的带注释的数据集(特别是1997 1997 Hub4任务),以便与参考方法进行比较。结果表明,我们的方法以更低的计算成本和更少的资源提供了更差的分割。此外,如果要分割的资源包括两种或更多种语言的语音,并且说话者随时在它们之间相互干扰,那么在实践中应用语音识别器将变得不可行,而我们的方法仍然可以无额外成本地应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号