Exploring Discriminative Speech Trajectory Structures

机译：探索区分性语音轨迹结构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The articulators of the human speech production mechanism can only move relatively sluggishly. This results in speech sounds of which the acoustic speech properties mostly change continuously and gradually over time. However, such continuity constraints are seldom exploited for the purpose of discriminating different phones. In order to explore to what extent incorporating continuity information can help to improve phone discrimination, we investigated a multi-frame MFCC representation in combination with a supervised dimensionality reduction method which aims at finding a low-dimensional representation that best separates the different phones. The speech continuity information is encoded by a second-order smoothness regular-izer. Experimental results on TIMIT phone classification show that the regularizer is helpful in better distinguishing vowels, but fails to improve the discrimination of consonants.

机译：人类语音产生机制的发音器只能相对缓慢地运动。这导致语音，其语音特性大多随时间连续地且逐渐地变化。但是，很少使用这种连续性约束来区分不同的电话。为了探索在何种程度上合并连续性信息可以帮助改善电话识别度，我们研究了多帧MFCC表示与有监督的降维方法相结合的方法，该方法旨在找到最能区分不同手机的低维表示。语音连续性信息由二阶平滑度调节器编码。 TIMIT电话分类的实验结果表明，正则化器有助于更好地区分元音，但无法改善辅音的辨别力。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|1794-1797|共4页
会议地点
作者
Heyun Huang; Louis ten Bosch; Bert Cranen; Lou Boves;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Dimensionality Reduction; Contextual Representation; TIMIT; regularization; Laplacian smoothing;

机译：降维;上下文表示; TIMIT;正规化;拉普拉斯平滑;

相似文献

外文文献
中文文献
专利

1. 汉语语音识别中区分性声调模型及最优集成方法 [J] . 黄浩, 朱杰东南大学学报（英文版） . 2007,第002期
2. Discriminative semi-parametric trajectory model for speech recognition [J] . K.C. Sim, M.J.F. Gales Computer speech and language . 2007,第4期

机译：语音识别的区分性半参数轨迹模型
3. Structured Discriminative Models For Speech Recognition: An Overview [J] . Gales M.J.F., Watanabe S., Fosler-Lussier E. Signal Processing Magazine, IEEE . 2012,第6期

机译：语音识别的结构化判别模型：概述
4. A Diagnostic Marker to Discriminate Childhood Apraxia of Speech From Speech Delay: III. Theoretical Coherence of the Pause Marker with Speech Processing Deficits in Childhood Apraxia of Speech [J] . Shriberg Lawrence D., Strand Edythe A., Fourakis Marios, Journal of speech, language, and hearing research: JSLHR . 2017,第4期

机译：诊断标记，以辨别童年的言论言语延迟：III。暂停标记与语音处理缺陷的暂停标记
5. Exploring Discriminative Speech Trajectory Structures [C] . Heyun Huang, Louis ten Bosch, Bert Cranen, INTERSPEECH 2012 . 2012

机译：探索鉴别性语音轨迹结构
6. Exploring Shared Structure Among Vehicle Trajectories [D] . ?Chen, Chen 2016

机译：探索在车辆轨迹共享结构
7. A Diagnostic Marker to Discriminate Childhood Apraxia of Speech From Speech Delay: III. Theoretical Coherence of the Pause Marker with Speech Processing Deficits in Childhood Apraxia of Speech [O] . Lawrence D. Shriberg, Edythe A. Strand, Marios Fourakis, -1

机译：从言语延迟区分儿童言语失用的诊断标记：III。儿童言语失用的暂停标记与语音处理缺陷的理论相干性
8. Action Recognition Using Discriminative Structured Trajectory Groups [O] . Atmosukarto Indriyati, Ahuja Narendra, Ghanem Bernard 2015

机译：使用区分结构轨迹组的动作识别
9. Exploring Speech-Enabled Dialogue with the Galaxy Communicator Infrastructure [R] . Bayer, S. , Doran, C. , George, B. 2001

机译：探索与Galaxy Communicator基础设施进行的语音对话

Exploring Discriminative Speech Trajectory Structures

摘要

著录项

相似文献

相关主题

期刊订阅