【24h】

Speaker adaptation for telephony data using speaker clustering

机译:使用扬声器群集对电话数据进行扬声器适应

获取原文

摘要

This paper reports an ongoing effort to develop an unsupervised on-line speaker adaptation method for telephony environment. All speakers in the training data corpus are acoustically pre-clustered into clusters, and a cluster-dependent system is built for each cluster. When a new telephony test speaker is given, a cluster, which is the closest to the speaker, is determined and selected by an improved distance measure. Based on this selected cluster, MLLR adaptation algorithm with block diagonal transformation is applied to move the cluster model to be closer to the testing speaker. For telephony application the adaptation data can be very short or noisy, potentially, the MLLR adapted means can be unreliable. A MAP-like weighting scheme for MLLR adaptation is applied to insure the adapted mean realiable when the adaptation data is very short.
机译:本文报告了正在进行的努力,以开发一种用于电话环境的无监督在线扬声器自适应方法。训练数据语料库中的所有说话者都经过声学预聚类,并且为每个聚类构建了一个与聚类相关的系统。当给出新的电话测试扬声器时,通过改进的距离度量来确定和选择最接近扬声器的群集。基于此选定的群集,采用具有块对角线变换的MLLR自适应算法将群集模型移动到更靠近测试说话者的位置。对于电话应用,适配数据可能非常短或嘈杂,潜在地,MLLR适配的装置可能是不可靠的。适用于MLLR适应的MAP类加权方案可确保在适应数据非常短时可实现所适应的均值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号