首页> 外文会议>INTERSPEECH 2012 >A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions
【24h】

A simple and efficient method to align very long speech signals to acoustically imperfect transcriptions

机译:一种简单富有高效的方法,使得非常长的语音信号对抗声学缺乏缺陷的转录

获取原文

摘要

In the framework of a contract with the Basque Parliament for subtitling the videos of bilingual plenary sessions, which basically consisted of aligning very long (around 3 hours long) audio tracks with syntactically correct but acoustically inaccurate text transcriptions (since all the disfluencies, mistakes, etc. were edited), a very simple and efficient procedure (avoiding the need for language nor lexical models, which was key because of the mix of languages) was developed as a first approach, before trying more complex schemes found in the literature. Since it worked pretty well and the output was quite satisfactory for the intended application, that simple approach was finally chosen. In this paper, we describe the approach in detail and apply it to a widely known annotated dataset (specifically, to the 1997 Hub4 task), to allow the comparison to a reference approach. Results demonstrate that our approach provides only slightly worse segmentations at a much lower computational cost and requiring much fewer resources. Moreover, if the resource to be segmented includes speech in two or more languages and speakers conmute between them at any time, applying a speech recognizer becomes unfeasible in practice, whereas our approach can be still applied with no additional cost.
机译:在合同中与巴斯克议会框架与语法正确的,但听觉不准确的文本记录字幕双语全体会议,基本上由对齐很长(约3小时之久)的音轨的视频(因为所有的不流利,失误,等被编辑),一个非常简单而有效的方法(避免语言也不词汇模型,因为语言的混合产品,其关键的需要)试图更复杂的方案之前,是作为第一种方法,在文献中找到。由于它的工作非常好和产量预期应用中规中矩,简单的方法,最终选择。在本文中,我们详细描述的方法,并把它应用到一个公知的带注释的数据集(即,对1997年的huB4任务),以使相比于参考的方法。结果证明我们的方法以低得多的计算成本,并需要大量的资源更少只提供略差分割。此外,如果要分割的资源包括语音在两个或更多的语言和音箱之间conmute在任何时候,应用语音识别成为在实践中是不可行的,而我们的方法可以在没有额外的成本仍然适用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号