首页> 外文会议>Proceeding of the First international conference on multimodal interfaces(ICMI'96) >Text-Independent Speaker Identification Using Average Spectrum And GMM Approaches
【24h】

Text-Independent Speaker Identification Using Average Spectrum And GMM Approaches

机译:使用平均频谱和GMM方法的与文本无关的说话人识别

获取原文
获取原文并翻译 | 示例

摘要

A text-independent speaker identification system is implemented in this paper. Average FFT spectrum feature is used as the first approach to identify different speakers, because of its relatively good performance and low cost. Then, Gaussian Mixture Model(GMM) approach is used to model the utterance more precisely and to reduce the noise affections. The solving algorithm of the two approaches are given, and some important factors affecting the efficiency and reliability of the system are focused on, such as the decision threshold, the preemphasis coefficient, GMM model order, utterance length for testing and training.
机译:本文实现了一种与文本无关的说话人识别系统。由于平均FFT频谱功能相对较好的性能和较低的成本,因此它被用作识别不同扬声器的第一种方法。然后,使用高斯混合模型(GMM)方法对发声进行更精确的建模并减少噪声影响。给出了两种方法的求解算法,重点讨论了影响系统效率和可靠性的重要因素,例如决策阈值,预加重系数,GMM模型阶数,测试和训练的发声长度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号