首页> 外文会议>Annual conference of the International Speech Communication Association >Confidence for Speaker Diarization using PCA Spectral Ratio
【24h】

Confidence for Speaker Diarization using PCA Spectral Ratio

机译:使用PCA频谱比率进行说话人区分的信心

获取原文

摘要

Confidence scoring is an important component in speaker diarization systems, both for offline speech analytics and for online diarization that are required to produce the speaker segmentation from very little audio. This paper proposes a confidence measure for speaker diarization based on the spectral ratio of the eigenvalues of the Principal Component Analysis (PCA) transformation computed on the pre-segmented audio before diarization is performed on the conversation. We tested our method on two-speaker data and our results show the effectiveness of the PCA's spectral ratio confidence measure for both offline and online diarization. We compare and contrast our proposed confidence measure with other clustering validation methods that provide a quantitative measure of the segmentation quality but are calculated on the segmented data after diarization is performed, and with a related approach that extracts a confidence from the PCA of the pre-segmented audio.
机译:置信度评分是说话人差异化系统中的重要组成部分,对于离线语音分析和在线差异化而言,从很少的音频中产生说话人分割都是必需的。本文提出了一种基于对说话人进行二值化之前对预分段音频进行计算的主成分分析(PCA)变换的特征值频谱比的置信度度量。我们在两个扬声器的数据上测试了我们的方法,我们的结果表明PCA的频谱比置信度测量对于离线和在线二值化都是有效的。我们将我们提出的置信度测量与其他聚类验证方法进行比较和对比,其他聚类验证方法提供了对分割质量的定量测量,但是是在执行了二值化之后根据分割后的数据计算得出的,并且使用了一种相关方法,该方法从前期PCA中提取了置信度分段音频。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号