首页> 外文会议>International Conference of Emerging Applications of Information Technology >Floating to Fixed-Point Translation with Its Application to Speech-Based Emotion Recognition
【24h】

Floating to Fixed-Point Translation with Its Application to Speech-Based Emotion Recognition

机译:漂浮到固定点转换与其在基于语音的情感识别中的应用

获取原文

摘要

Speech-based emotion recognition is one of the latest challenges in speech processing. The algorithms are developed using floating-point arithmetic because of its wide dynamic range and constant relative accuracy. However, they are finally implemented in hand held devices which are required to consume less power, time and have a lower market price. Fixed-point arithmetic with proper determination of integer and fractional bitwidths can help in satisfying these requirements. Therefore we have made an attempt to develop a fixed-point speech-based emotion recognition system using Mel frequency cepstral coefficients (MFCCs) and hidden Markov model (HMM). Accurate range and precision analysis has been carried out to compute optimum integer and fractional word lengths. The speech emotion engine has been evaluated using Berlin emotional speech database, EMO-DB. A speaker independent emotion recognition accuracy of 71.02% and 67.42% for floating-point and fixed-point formats with optimized wordlenghs respectively was achieved. Finite wordlength effect like quantization with range of relative errors and its effect on emotion recognition task has been analyzed.
机译:基于语音的情感识别是语音处理中的最新挑战之一。由于其宽动态范围和恒定的相对精度,使用浮点算术开发了算法。然而,它们最终在手持设备中实施,这些设备必须消耗更少的功率,时间并具有较低的市场价格。具有正确确定整数和分数位宽度的定点算术可以有助于满足这些要求。因此,我们已经尝试使用MEL频率谱系数(MFCC)和隐马尔可夫模型(HMM)开发一种基于固定点语音的情感识别系统。已经进行了准确的范围和精度分析以计算最佳整数和分数字长度。使用柏林情感语音数据库,emo-dB进行了评估语音情感引擎。达到漂浮点和定点格式的扬声器独立情绪识别准确度为71.02%和67.42%,分别具有优化的Wordlenghs。分析了具有相对误差范围的量化等有限字体长度效果及其对情感识别任务的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号