【24h】

Hunting for Wolves in Speaker Recognition

机译:寻找说话者识别中的狼

获取原文
获取原文并翻译 | 示例

摘要

Identification and selection of speaker pairs that are difficult to distinguish offers the possibility of better focusing speaker recognition research, while also reducing the amount of data needed to estimate system performance with confidence. This work aims to predict which speaker pairs will be difficult for automatic speaker recognition systems to distinguish, by using features that characterize speakers, and thus provide a measure of speaker similarity. Features tested include pitch, jitter, shimmer, formant frequencies, energy, long term average spectrum energy, histograms of frequencies from roots of LPC coefficients, and spectral slope. Absolute and percent differences, Euclidean distance, and correlation coefficients are utilized to measure the closeness of these speaker features. Using data from NIST's 2008 Speaker Recognition Evaluation, the largest changes in detection cost and false alarm rate for similar speaker pairs (relative to all speaker pairs) occurs when speaker pairs are selected using the Euclidean distance between vectors of the mean first, second, and third formant frequencies. Even bigger differences in performance occur when speaker pairs are selected using the KL divergence between speaker-specific GMMs as a measure of similarity. In general, the feature-measures considered here are more successful at finding easy-to-distinguish speaker pairs than difficult-to-distinguish ones, and can provide potentially useful information about a speaker's tendency to be similar or dissimilar to other speakers.
机译:难以区分的说话人对的识别和选择为更好地集中说话人识别研究提供了可能性,同时也减少了信心评估系统性能所需的数据量。这项工作旨在通过使用表征说话人的特征来预测自动说话人识别系统难以区分哪些说话人对,从而提供说话人相似性的度量。测试的功能包括音调,抖动,闪烁,共振峰频率,能量,长期平均频谱能量,LPC系数根源的频率直方图和频谱斜率。绝对差和百分比差,欧几里得距离和相关系数用于测量这些扬声器特征的接近度。使用NIST 2008说话者识别评估中的数据,当使用平均第一,第二和第二矢量之间的欧几里德距离选择说话者对时,相似说话者对(相对于所有说话者对)的检测成本和误报率变化最大。第三共振峰频率。当使用特定于扬声器的GMM之间的KL差异来选择扬声器对时,在性能上会出现更大的差异,以作为相似度的度量。通常,此处考虑的特征量度比难区分​​的对更能找到易于区分的对,并且可以提供有关说话者与其他说话者相似或不相似的趋势的潜在有用信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号