首页> 外文会议>Annual conference of the International Speech Communication Association >Multi-System Fusion of Extended Context Prosodic and Cepstral Features for Paralinguistic Speaker Trait Classification
【24h】

Multi-System Fusion of Extended Context Prosodic and Cepstral Features for Paralinguistic Speaker Trait Classification

机译:扩展上下文韵律和倒谱特征的多系统融合,用于副语言说话人特质分类

获取原文

摘要

As automatic speech processing has matured, research atten tion has expanded to paralinguistic speech problems that aim to detect beyond-the-words information. This paper focuses on the identification of seven speaker trait categories from the Interspeech Speaker Trait Challenge: likeability, intelligibility, openness, conscientiousness, extraversion, agreeableness, and neuroticism. Our approach combines multiple features includ ing prosodic, cepstral, shifted-delta cepstral, and a reduced set of the OpenSMILE features. Our classification approaches in cluded GMM-UBM, eigenchannel, support vector machines, and distance based classifiers. Optimized feature reduction and logistic regression-based score calibration and fusion led to re sults that perform competitively against the challenge baseline in all categories.
机译:随着自动语音处理的成熟,研究重点已经扩展到旨在检测词外信息的副语言语音问题。本文着重从国际演讲者特质挑战中识别出七个说话者特质类别:相似性,可理解性,开放性,尽责性,外向性,顺从性和神经质。我们的方法结合了多种功能,包括韵律韵律,倒谱倒谱,倒数倒谱倒谱以及减少的OpenSMILE功能集。我们的分类方法包括GMM-UBM,特征通道,支持向量机和基于距离的分类器。优化的特征缩减以及基于逻辑回归的分数校准和融合导致结果在所有类别中均与挑战基准相比具有竞争优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号