首页> 外文期刊>Journal of Experimental & Theoretical Artificial Intelligence >Hierarchical classifier design for speech emotion recognition in the mixed-cultural environment
【24h】

Hierarchical classifier design for speech emotion recognition in the mixed-cultural environment

机译:混合文化环境中语音情感识别的等级分类器设计

获取原文
获取原文并翻译 | 示例
       

摘要

Recognition of emotion in speech is a difficult task due to many speaker factors like gender, age, and the cultural background (nationality, ethnicity, and region) as well as the acoustical environment. Among these factors, the cultural background of the speaker has a strong influence on the expression of emotion. The reason for the unsatisfactory performance of an emotion recognition engine built using mixed-cultural samples can be traced back to this. To address this issue, a two-level hierarchical engine has been designed to identify emotion from the speech of different cultural backgrounds. The first level of the hierarchical engine is a culture identification system, which identifies the corpus of an input utterance. As most of the speakers involved in the construction of a specific corpus are from the same locality and cultural background, we assume that a corpus represents the cultural background of the speakers of the corpus constructed. Based on the response of the first level classifier, the input utterance is forwarded to an appropriate corpus-specific emotion recognition engine, in the second level. Each corpus-specific emotion recognition system is a discriminative, multiclass SVM classifier, trained with the emotional utterances of that particular corpus. The system has been tested with five different corpora, collected from diverse cultural backgrounds, namely EMO-DB, SAVEE, IITKGP-SEC, Spanish corpus S0329, and CMU's Woogles corpus. The system achieved an accuracy of 82.01% which is an improvement of 13.38% over monolithic approaches.
机译:由于性别,年龄和文化背景(民族,种族和地区)以及声学环境以及声学环境以及声学环境,造成言论的情感造成的情绪是一项艰巨的任务。在这些因素中,演讲者的文化背景对情感的表达有很大影响。使用混合文化样品建造的情感识别发动机表现不令人满意的原因可以追溯到这一点。为了解决这个问题,旨在旨在识别不同文化背景的言论的情感。分层发动机的第一级是一种培养识别系统,其识别输入话语的语料库。由于大多数涉及特定语料库的扬声器都来自同一地点和文化背景,我们假设语料库代表了构建的语料库的文化背景。基于第一级分类器的响应,将输入话语转发到第二级的适当语料库特定情感识别引擎。每个语料库特定的情感识别系统是一种鉴别的多字母SVM分类器,培训了那种特定语料库的情感话语。该系统已通过五种不同的基层测试,从不同的文化背景,即emo-db,savee,iitkgp-sec,西班牙语语料库s0329和cmu的Woogles语料库中收集。该系统达到了82.01%的准确度,其在整体方法上的提高13.38%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号