TRAJECTORY CLUSTERING FOR AUTOMATIC SPEECH RECOGNITION

机译：自动语音识别的轨迹聚类

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we present an approach for automatic clustering of multi-dimensional dynamic trajectories corresponding to speech data that is based on Trajectory Clustering (TC). TC uses the Expectation Maximization algorithm (EM) for clustering with the mixtures of Multiple Linear Regression model. Since the initial values of the model parameters are critical to the clustering performance, a successive splitting algorithm was developed to incrementally increase the number of clusters. We define multipath HMM topologies using the trajectory clusters found. Based on the hypothesis that pronunciation variation in speech is more systematic at a unit level that is longer than a phone, we used modelling units defined in terms of Head-Body-Tail (HBT) models for connected digit recognition for the Dutch language. It appears that multi-path HMM topologies based on TC clusters outperform multi-path HMM topologies based on prior knowledge about speaker gender and speaking rate.

机译：在本文中，我们提出了一种基于轨迹聚类（TC）的与语音数据相对应的多维动态轨迹自动聚类的方法。 TC使用“期望最大化”算法（EM）与多重线性回归模型的混合物进行聚类。由于模型参数的初始值对于聚类性能至关重要，因此开发了一种连续的分割算法来逐步增加聚类数量。我们使用找到的轨迹簇定义多路径HMM拓扑。基于语音的语音变化在比电话更长的单位级别上更为系统化的假设，我们使用根据头尾（HBT）模型定义的建模单位来进行荷兰语的关联数字识别。似乎基于TC群集的多路径HMM拓扑优于基于说话者性别和发声率的先验知识的多路径HMM拓扑。

著录项

来源
《European Signal Processing Conference(EUSIPCO 2005); 20050904-08; Antalya(TK)》|2005年|P.1576-1579|共4页
会议地点 Antalya(TK)
作者
Yan Han; Johan de Veth; Louis Boves;
展开▼
作者单位

Center for Language and Speech Technology, Department of Language and Speech, Radboud University, Nijmegen, The Netherlands;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Trajectory Clustering for Solving the Trajectory Folding Problem in Automatic Speech Recognition [J] . Han Y., Johan de Veth, Boves L. IEEE transactions on audio, speech and language processing . 2007,第4期

机译：轨迹聚类解决语音自动识别中的轨迹折叠问题
2. Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition [J] . Watanabe S., Sako A., Nakamura A. IEEE transactions on audio, speech and language processing . 2006,第3期

机译：基于变分贝叶斯估计和聚类的大词汇量连续语音识别自动确定声学模型拓扑
3. Automatic trajectory recognition in Active Target Time Projection Chambers data by means of hierarchical clustering [J] . Dalitz Christoph, Ayyad Yassid, Wilberg Jens, Computer physics communications . 2019,第期

机译：通过分层聚类自动轨迹识别活动目标时间投影室数据
4. Trajectory clustering for automatic speech recognition [C] . Yan Han, de Veth Johan, Boves Louis European Signal Processing Conference . 2005

机译：轨迹聚类自动语音识别
5. An automatic speech recognition oriented study on segmentation, low dimensional feature extraction, and temporal trajectory information capture. [D] . Zhu, Yonggang. 2002

机译：面向语音识别的自动研究，涉及分割，低维特征提取和时间轨迹信息捕获。
6. Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference [O] . Byeongwook Lee, Kwang-Hyun Cho -1

机译：以语音包络作为时间参考的自动语音识别的大脑启发式语音分割
7. Trajectory Clustering for Solving the Trajectory Folding Problem in Automatic Speech Recognition [O] . Yan Han, Johan De Veth, Lou Boves 2008

机译：用于解决自动语音识别中轨迹折叠问题的轨迹聚类

TRAJECTORY CLUSTERING FOR AUTOMATIC SPEECH RECOGNITION

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅