【24h】

Learning intra-speaker model parameter correlations from many short speaker segments

机译:从许多简短的演讲者群体中学习演讲者内部模型参数的相关性

获取原文
获取原文并翻译 | 示例

摘要

Very rapid speaker adaptation algorithms, such as eigen-voices or speaker clustering, typically rely on learning intra-speaker correlations of model parameters from the training data. On the base of this a-priori knowledge, many model parameters can be successfully adapted on the basis of few observations. However, eigenvoice training or speaker clustering is non-trivial with training databases containing many short speaker segments, where for each speaker the available data to detect intra-speaker correlations is sparse. We have trained eigenvoices that yield a small but significant word error rate reduction in on-line adaptation (i.e. self adaptation) for a telephony database with on average only 5 seconds of speech per speaker in training and test data.
机译:非常快速的说话者自适应算法,例如特征语音或说话者聚类,通常依赖于从训练数据中学习模型参数的说话者内相关性。在此先验知识的基础上,可以基于少量观察成功地调整许多模型参数。然而,本征语音训练或说话者聚类对于包含许多简短说话者片段的训练数据库来说是不简单的,其中对于每个说话者而言,用于检测说话者内相关性的可用数据是稀疏的。我们已经训练了特征语音,该特征语音对于电话数据库的在线自适应(即自我自适应)产生了很小但明显的单词错误率降低,而在培训和测试数据中每个说话者平均只有5秒的语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号