首页> 外文会议>International conference on text, speech and dialogue >A Multi-criteria Text Selection Approach for Building a Speech Corpus
【24h】

A Multi-criteria Text Selection Approach for Building a Speech Corpus

机译:建立语音语料库的多标准文本选择方法

获取原文

摘要

Speech corpus is an important and primary requirement for several speech tasks. Building a speech corpora is a lengthy, time consuming and expensive process, it typically involves collection of a large set of textual utterances and then selective distribution of these text utterances among a set of speakers, called speaker sheets. These speaker sheets are articulated by speakers to generate the speech corpora. Depending on the task at hand the speech corpora needs to satisfy certain criteria; For example, a phonetically balanced speech corpora is essential for building an automatic speech recognition (ASR) engine, while for a text dependent speaker recognition engine there is a need for several spoken repetition of the same text by several speakers. In this paper, we formulate a method that enables creation of speaker sheets from a predetermined set of text utterances such that the speech corpora satisfies the desired requirement.
机译:语音语料库是几个语音任务的重要而基本的要求。建立语音语料库是一个漫长,耗时且昂贵的过程,通常需要收集大量的语音,然后在一组称为“发言者表”的发言人之间选择性地分配这些语音。这些说话者表由说话者表达,以产生语音语料库。根据手头的任务,语音语料库需要满足某些条件。例如,语音平衡的语料库对于构建自动语音识别(ASR)引擎是必不可少的,而对于依赖于文本的说话者识别引擎,则需要多个说话者对同一文本进行多次口头重复。在本文中,我们制定了一种方法,该方法能够从一组预定的语音中创建说话者表,从而使语料库满足所需的要求。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号