首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Multigrained modeling with pattern specific maximum likelihood transformations for text-independent speaker recognition
【24h】

Multigrained modeling with pattern specific maximum likelihood transformations for text-independent speaker recognition

机译:具有模式特定最大似然变换的多颗粒建模,用于与文本无关的说话人识别

获取原文
获取原文并翻译 | 示例
       

摘要

We present a transformation-based, multigrained data modeling technique in the context of text independent speaker recognition, aimed at mitigating difficulties caused by sparse training and test data. Both identification and verification are addressed, where we view the entire population as divided into the target population and its complement, which we refer to as the background population. First, we present our development of maximum likelihood transformation based recognition with diagonally constrained Gaussian mixture models and show its robustness to data scarcity with results on identification. Then for each target and background speaker, a multigrained model is constructed using the transformation based extension as a building block. The training data is labeled with an HMM based phone labeler. We then make use of a graduated phone class structure to train the speaker model at various levels of detail. This structure is a tree with the root node containing all the phones. Subsequent levels partition the phones into increasingly finer grained linguistic classes. This method affords the use of fine detail where possible, i.e., as reflected in the amount of training data distributed to each tree node. We demonstrate the effectiveness of the modeling with verification experiments in matched and mismatched conditions.
机译:在文本独立的说话人识别的背景下,我们提出了一种基于变换的,多粒度的数据建模技术,旨在缓解因稀疏训练和测试数据而造成的困难。识别和验证都得到了解决,在这里我们将整个人口分为目标人口及其补充,我们将其称为背景人口。首先,我们介绍了基于对角约束高斯混合模型的基于最大似然变换的识别技术的发展,并通过识别结果显示了其对数据稀缺性的鲁棒性。然后,对于每个目标说话者和背景说话者,使用基于变换的扩展作为构建块来构建多粒度模型。培训数据使用基于HMM的电话标签器进行标签。然后,我们使用分级的电话班级结构来训练扬声器模型的各个细节级别。该结构是一个树,其根节点包含所有电话。随后的级别将电话分为越来越精细的语言类别。该方法在可能的情况下提供了精细的细节,即反映在分配给每个树节点的训练数据量中。我们通过匹配和不匹配条件下的验证实验证明了建模的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号