首页> 外文会议>INTERSPEECH 2012 >Exploring Discriminative Speech Trajectory Structures
【24h】

Exploring Discriminative Speech Trajectory Structures

机译:探索鉴别性语音轨迹结构

获取原文

摘要

The articulators of the human speech production mechanism can only move relatively sluggishly. This results in speech sounds of which the acoustic speech properties mostly change continuously and gradually over time. However, such continuity constraints are seldom exploited for the purpose of discriminating different phones. In order to explore to what extent incorporating continuity information can help to improve phone discrimination, we investigated a multi-frame MFCC representation in combination with a supervised dimensionality reduction method which aims at finding a low-dimensional representation that best separates the different phones. The speech continuity information is encoded by a second-order smoothness regularizer. Experimental results on TIMIT phone classification show that the regularizer is helpful in better distinguishing vowels, but fails to improve the discrimination of consonants.
机译:人类语音生产机制的铰接器只能相对缓慢地移动。这导致语音声音,声音声音,声音词性随时间逐渐变化。然而,这种连续性约束很少被利用,以识别不同的手机。为了探讨结合连续性信息的程度可以有助于改善电话识别,我们研究了一种多帧MFCC表示与监督的维度减少方法组合,其旨在找到最佳分隔不同电话的低维表示。语音连续性信息由二阶平滑度规范器编码。 Timit Phone Classification的实验结果表明,规范器在更好的区别中有用,但未能改善辅音的歧视。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号