首页> 外文会议>Hellenic Conference on Artificial Intelligence >Audio Features Selection for Automatic Height Estimation from Speech
【24h】

Audio Features Selection for Automatic Height Estimation from Speech

机译:音频功能选择用于语音的自动高度估计

获取原文

摘要

Aiming at the automatic estimation of the height of a person from speech, we investigate the applicability of various subsets of speech features, which were formed on the basis of ranking the relevance and the individual quality of numerous audio features. Specifically, based on the relevance ranking of the large set of openSMILE audio descriptors, we performed selection of subsets with different sizes and evaluated them on the height estimation task. In brief, during the speech parameterization process, every input utterance is converted to a single feature vector, which consists of 6552 parameters. Next, a subset of this feature vector is fed to a support vector machine (SVM)-based regression model, which aims at the straight estimation of the height of an unknown speaker. The experimental evaluation performed on the TIMIT database demonstrated that: (i) the feature vector composed of the top-50 ranked parameters provides a good trade-off between computational demands and accuracy, and that (ii) the best accuracy, in terms of mean absolute error and root mean square error, is observed for the top-200 subset.
机译:旨在自动估计来自语音的人的高度,我们调查了语音特征的各种子集的适用性,这是基于排名的相关性和许多音频特征的个人质量而形成。具体地,基于大集合的开放式音频描述符的相关性排序,我们执行了具有不同大小的子集的选择,并在高度估计任务上进行评估。简而言之,在语音参数化过程中,每个输入话语被转换为单个特征向量,该传感器由6552个参数组成。接下来,将该特征向量的子集馈送到基于支持向量机(SVM)的回归模型,其旨在直接估计未知扬声器的高度。在Timit数据库上执行的实验评估表明:(i)由前50个排名参数组成的特征向量在计算需求和准确性之间提供了良好的权衡,并且(ii)在平均值方面是最佳准确性对于前200个子集,观察到绝对误差和根均方误差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号