首页> 外文会议>International Conference on Artificial Intelligence in Information and Communication >A New Unsupervised Short-Utterance based Speaker Identification Approach with Parametric t-SNE Dimensionality Reduction
【24h】

A New Unsupervised Short-Utterance based Speaker Identification Approach with Parametric t-SNE Dimensionality Reduction

机译:一种新的无监督基于短语的扬声器识别方法,具有参数T-SNE维数减少

获取原文

摘要

State-of-the-art speaker identification (SI) systems have achieved accuracies of 100% with long-duration utterances which are impractical. Recently, short-utterance based systems have gained attention although identification rates are lower. This paper presents an approach for text-dependent speaker phoneme-based (<1sec) SI with parametric t-distributed stochastic neighbor embedding (pt-SNE) for dimensionality reduction of features to provide 3D-visualization. The approach employs Gaussian mixture model enhanced by $K-means++$ and gap statistic methods. As there is no other similar work, a fair comparison is unavailable. The 75% rate achieved is comparable to other works using (i) short-utterances (ii) pt-SNE for recognition of other data types.
机译:最先进的扬声器识别(Si)系统已经实现了100 %的精度,长期发声是不切实际的。最近,虽然识别率较低,但基于短语的系统效率受到关注。本文介绍了一种基于文本依赖性扬声器音素的(<1SEC)SI的方法,具有参数T分布式随机邻居嵌入(PT-SNE),用于减少特征,以提供3D可视化。该方法采用高斯混合模型增强了$ k-means ++ $和缺口统计方法。由于没有其他类似的工作,公平的比较是不可用的。实现的75%的速率与使用(i)短语(ii)pt-sne的其他作品相当,用于识别其他数据类型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号