首页> 外文期刊>Consumer Electronics, IEEE Transactions on >GMM adaptation based online speaker segmentation for spoken document retrieval
【24h】

GMM adaptation based online speaker segmentation for spoken document retrieval

机译:基于GMM自适应的在线说话人分割,用于语音文档检索

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes an online speaker segmentation approach based on Gaussian Mixture Model (GMM) adaptation for spoken document retrieval. In the conventional approach using the Bayesian Information Criterion (BIC), two single Gaussian models are respectively constructed for two divided speech streams in an analysis window, and the dissimilarity between the two models is estimated according to the BIC principle. This approach has been widely applied to speaker segmentation. However, its performance may deteriorate when speakers change frequently, since the single Gaussian model hardly represent the speaker's explicit characteristics for short speech data. To overcome this limitation, we propose an approach to use adapted GMMs instead of single Gaussian models. The method proposed herein constructs a local UBM for speech in an analysis window and adapts the local UBM to each of two divided speech streams in the same window. Upon the two adapted GMMs obtained from the adaptation, the likelihood of the respective speech stream is estimated and change of speaker is determined according to our criterion based on local maxima of BIC. On speaker segmentation experiments based on HUB4, a well-known broadcast news corpus, the proposed method exhibited superior performance compared to the conventional approaches.
机译:本文提出了一种基于高斯混合模型(GMM)自适应的在线说话者分割方法,用于语音文档检索。在使用贝叶斯信息准则(BIC)的常规方法中,分别在分析窗口中为两个划分的语音流构建两个单一的高斯模型,并根据BIC原理估计两个模型之间的相异性。该方法已被广泛应用于说话人细分。但是,由于单个高斯模型几乎不能代表说话者对短语音数据的显式特征,因此当说话者频繁更换时,其性能可能会下降。为了克服此限制,我们提出了一种使用自适应GMM而不是单个高斯模型的方法。本文提出的方法在分析窗口中构造用于语音的本地UBM,并使该本地UBM适应同一窗口中的两个划分的语音流中的每一个。基于从自适应获得的两个自适应GMM,估计各自语音流的可能性,并根据我们基于BIC的局部最大值的标准确定说话者的变化。在基于知名广播新闻语料库HUB4的说话人分割实验中,与传统方法相比,该方法具有更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号