首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >SYLL-O-MATIC: AN ADAPTIVE TIME-FREQUENCY REPRESENTATION FOR THE AUTOMATIC SEGMENTATION OF SPEECH INTO SYLLABLES
【24h】

SYLL-O-MATIC: AN ADAPTIVE TIME-FREQUENCY REPRESENTATION FOR THE AUTOMATIC SEGMENTATION OF SPEECH INTO SYLLABLES

机译:syll-o-matic:自动分段为语音分段为音节的自适应时频表示

获取原文

摘要

This paper introduces novel paradigms for the segmentation of speech into syllables. The main idea of the proposed method is based on the use of a time-frequency representation of the speech signal, and the fusion of intensity and voicing measures through various frequency regions for the automatic selection of pertinent information for the segmentation. The time-frequency representation is used to exploit the speech characteristics depending on the frequency region. In this representation, intensity profiles are measured to provide information into various frequency regions, and voicing profiles are measured to determine the frequency regions that are pertinent for the segmentation. The proposed method outperforms conventional methods for the detection of syllable landmark and boundaries on the TIMIT database of American-English, and provides a promising paradigm for the segmentation of speech into syllables.
机译:本文介绍了对音节分割的新颖范式。所提出的方法的主要思想基于使用语音信号的时频表示,以及通过各种频率区域的强度和发声测量的融合,用于自动选择分割的相关信息。时频表示用于根据频率区域利用语音特性。在该表示中,测量强度分布以向各种频率区域提供信息,并且测量发声轮廓以确定对分割相关的频率区域。所提出的方法优于检测美国英语的Timit数据库上的音节地标和边界的传统方法,并为演讲分割为音节提供了有希望的范式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号