首页> 外文会议>Annual conference of the International Speech Communication Association >Exploring Discriminative Speech Trajectory Structures
【24h】

Exploring Discriminative Speech Trajectory Structures

机译:探索区分性语音轨迹结构

获取原文

摘要

The articulators of the human speech production mechanism can only move relatively sluggishly. This results in speech sounds of which the acoustic speech properties mostly change continuously and gradually over time. However, such continuity constraints are seldom exploited for the purpose of discriminating different phones. In order to explore to what extent incorporating continuity information can help to improve phone discrimination, we investigated a multi-frame MFCC representation in combination with a supervised dimensionality reduction method which aims at finding a low-dimensional representation that best separates the different phones. The speech continuity information is encoded by a second-order smoothness regular-izer. Experimental results on TIMIT phone classification show that the regularizer is helpful in better distinguishing vowels, but fails to improve the discrimination of consonants.
机译:人类语音产生机制的发音器只能相对缓慢地运动。这导致语音,其语音特性大多随时间连续地且逐渐地变化。但是,很少使用这种连续性约束来区分不同的电话。为了探索在何种程度上合并连续性信息可以帮助改善电话识别度,我们研究了多帧MFCC表示与有监督的降维方法相结合的方法,该方法旨在找到最能区分不同手机的低维表示。语音连续性信息由二阶平滑度调节器编码。 TIMIT电话分类的实验结果表明,正则化器有助于更好地区分元音,但无法改善辅音的辨别力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号