首页> 外文会议>European Signal Processing Conference(EUSIPCO 2004) vol.1; 20040906-10; Vienna(AT) >AUTOMATIC SEGMENTATION AND LABELING OF CONTINUOUS SPEECH WITHOUT BOOTSTRAPPING
【24h】

AUTOMATIC SEGMENTATION AND LABELING OF CONTINUOUS SPEECH WITHOUT BOOTSTRAPPING

机译:自动分词和连续语音标签,无需引导

获取原文
获取原文并翻译 | 示例

摘要

In this paper, a novel approach is proposed for automatically segmenting and transcribing continuous speech signal without the use of manually segmented and labeled speech corpora. The continuous speech signal is first segmented into syllable-like units by considering short-term energy as a magnitude spectrum of some arbitrary signal. Similar syllable segments are then grouped together using an unsupervised and incremental clustering technique. Separate models are generated for each cluster of syllable segments. At this stage, labels are assigned for each group of syllable segments manually. The syllable models of these clusters are then used to transcribe/recognize the continuous speech signal of closed-set speakers as well open-set speakers. As a syllable recognizer, our initial results on Indian television news bulletins of the the languages Tamil and Telugu shows that the performance is 43.3% and 32.9% respectively.
机译:在本文中,提出了一种新颖的方法,用于自动分段和转录连续语音信号,而无需使用手动分段和标记的语音语料库。首先,通过将短期能量视为任意信号的幅度谱,将连续语音信号分割成音节状单元。然后使用无人监督和增量聚类技术将相似的音节段组合在一起。为每个音节片段群生成单独的模型。在此阶段,将为每组音节片段手动分配标签。然后,将这些聚类的音节模型用于转录/识别封闭式扬声器和开放式扬声器的连续语音信号。作为音节识别器,我们在印度电视新闻公报上以泰米尔语和泰卢固语两种语言显示的初步结果表明,该效果分别为43.3%和32.9%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号