首页> 外文会议>Annual conference of the International Speech Communication Association >Multi-System Fusion of Extended Context Prosodic and Cepstral Features for Paralinguistic Speaker Trait Classification
【24h】

Multi-System Fusion of Extended Context Prosodic and Cepstral Features for Paralinguistic Speaker Trait Classification

机译:延长上下文博物馆的多系统融合,抗肌肌肌剖面,PalAl语言扬声器特质分类

获取原文

摘要

As automatic speech processing has matured, research atten tion has expanded to paralinguistic speech problems that aim to detect beyond-the-words information. This paper focuses on the identification of seven speaker trait categories from the Interspeech Speaker Trait Challenge: likeability, intelligibility, openness, conscientiousness, extraversion, agreeableness, and neuroticism. Our approach combines multiple features includ ing prosodic, cepstral, shifted-delta cepstral, and a reduced set of the OpenSMILE features. Our classification approaches in cluded GMM-UBM, eigenchannel, support vector machines, and distance based classifiers. Optimized feature reduction and logistic regression-based score calibration and fusion led to re sults that perform competitively against the challenge baseline in all categories.
机译:随着自动语音处理已经成熟,研究效果扩展到Paralinguistic言语问题,该问题旨在检测单词信息。本文侧重于识别七位扬声器特质类别,从三个扬声器特征挑战挑战:可爱,可懂度,可懂度,开放性,尽职苛求性,外向,令人满意和神经质。我们的方法结合了多种特征,包括韵律,颅骨,移位 - δ颅脂素,以及一组减少的开放式功能。我们在Cluded GMM-UBM,Eigenchannel,支持向量机和基于距离的分类器中的分类方法。优化的特征减少和基于Logistic回归的分数校准和融合导致了竞争地对所有类别中的挑战基线进行竞争性的调整。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号