首页> 外文会议>Proceedings of 2010 IEEE EMBS Conference on Biomedical Engineering and Sciences >Comparison of speaker normalization techniques for classification of emotionally disturbed subjects based on voice
【24h】

Comparison of speaker normalization techniques for classification of emotionally disturbed subjects based on voice

机译:基于语音对情感障碍对象进行分类的说话人归一化技术的比较

获取原文

摘要

When reviewing his clinical experience in treating suicidal patients, one of the authors observed that successful predictions of suicidality were often based on the patients voice independent of content. Research has shown that the Gaussian mixture model of the mel-cepstral features of speech can be used to distinguish the speech of suicidal persons from that of depressed and control persons with high classification rates. Since the vocal tract length vary from person to person, can the classification rates of suicidal persons be improved through speaker normalization? We approach this problem by warping the frequency axis of the mel-cepstral features. The results show that two different approaches yielded the best results: i) by using the maximum-likelihood approach in a gender-independent database to compute the warping factor for a nonlinear warp and ii) by a transformation of the first three formants in a gender-dependent database to compute the warping factor for a linear warp.
机译:在回顾他治疗自杀患者的临床经验时,一位作者观察到成功的自杀预测通常是基于患者的声音与内容无关的。研究表明,语音的梅斯倒谱特征的高斯混合模型可用于区分自杀者的语音与高分类率的抑郁者和控制者的语音。由于声道长度因人而异,因此可以通过说话人归一化来提高自杀者的分类率吗?我们通过扭曲mel倒谱特征的频率轴来解决此问题。结果表明,两种不同的方法产生了最佳结果:i)通过在性别无关的数据库中使用最大似然方法来计算非线性翘曲的翘曲因子,并且ii)通过转换性别中的前三个共振峰来进行依赖的数据库,以计算线性变形的变形因子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号