【24h】

TRAJECTORY CLUSTERING FOR AUTOMATIC SPEECH RECOGNITION

机译:自动语音识别的轨迹聚类

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we present an approach for automatic clustering of multi-dimensional dynamic trajectories corresponding to speech data that is based on Trajectory Clustering (TC). TC uses the Expectation Maximization algorithm (EM) for clustering with the mixtures of Multiple Linear Regression model. Since the initial values of the model parameters are critical to the clustering performance, a successive splitting algorithm was developed to incrementally increase the number of clusters. We define multipath HMM topologies using the trajectory clusters found. Based on the hypothesis that pronunciation variation in speech is more systematic at a unit level that is longer than a phone, we used modelling units defined in terms of Head-Body-Tail (HBT) models for connected digit recognition for the Dutch language. It appears that multi-path HMM topologies based on TC clusters outperform multi-path HMM topologies based on prior knowledge about speaker gender and speaking rate.
机译:在本文中,我们提出了一种基于轨迹聚类(TC)的与语音数据相对应的多维动态轨迹自动聚类的方法。 TC使用“期望最大化”算法(EM)与多重线性回归模型的混合物进行聚类。由于模型参数的初始值对于聚类性能至关重要,因此开发了一种连续的分割算法来逐步增加聚类数量。我们使用找到的轨迹簇定义多路径HMM拓扑。基于语音的语音变化在比电话更长的单位级别上更为系统化的假设,我们使用根据头尾(HBT)模型定义的建模单位来进行荷兰语的关联数字识别。似乎基于TC群集的多路径HMM拓扑优于基于说话者性别和发声率的先验知识的多路径HMM拓扑。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号