首页> 外文会议>Hellenic Conference on Artificial Intelligence >Audio Features Selection for Automatic Height Estimation from Speech

【24h】

Audio Features Selection for Automatic Height Estimation from Speech

机译：音频功能选择用于语音的自动高度估计

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Aiming at the automatic estimation of the height of a person from speech, we investigate the applicability of various subsets of speech features, which were formed on the basis of ranking the relevance and the individual quality of numerous audio features. Specifically, based on the relevance ranking of the large set of openSMILE audio descriptors, we performed selection of subsets with different sizes and evaluated them on the height estimation task. In brief, during the speech parameterization process, every input utterance is converted to a single feature vector, which consists of 6552 parameters. Next, a subset of this feature vector is fed to a support vector machine (SVM)-based regression model, which aims at the straight estimation of the height of an unknown speaker. The experimental evaluation performed on the TIMIT database demonstrated that: (i) the feature vector composed of the top-50 ranked parameters provides a good trade-off between computational demands and accuracy, and that (ii) the best accuracy, in terms of mean absolute error and root mean square error, is observed for the top-200 subset.

机译：旨在自动估计来自语音的人的高度，我们调查了语音特征的各种子集的适用性，这是基于排名的相关性和许多音频特征的个人质量而形成。具体地，基于大集合的开放式音频描述符的相关性排序，我们执行了具有不同大小的子集的选择，并在高度估计任务上进行评估。简而言之，在语音参数化过程中，每个输入话语被转换为单个特征向量，该传感器由6552个参数组成。接下来，将该特征向量的子集馈送到基于支持向量机（SVM）的回归模型，其旨在直接估计未知扬声器的高度。在Timit数据库上执行的实验评估表明：（i）由前50个排名参数组成的特征向量在计算需求和准确性之间提供了良好的权衡，并且（ii）在平均值方面是最佳准确性对于前200个子集，观察到绝对误差和根均方误差。

著录项

来源
《Hellenic Conference on Artificial Intelligence 》|2010年||共10页
会议地点
作者
Todor Ganchev; Iosif Mporas; Nikos Fakotakis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Height estimation from speech; Speech parameterization; Feature ranking; Feature selection; SVM regression models;

机译：语音的高度估计;语音参数化;特征排名;特征选择;SVM回归模型;

相似文献

外文文献
中文文献
专利

1. 用于语音转换的有效基音频率转换算法 [J] . 宋鹏, 金赟, 包永强, 东南大学学报（英文版） . 2012 ,第002期
2. Attention and Feature Selection for Automatic Speech Emotion Recognition Using Utterance and Syllable-Level Prosodic Features [J] . Ben Alex Starlet, Mary Leena, Babu Ben P. Circuits, systems and signal processing . 2020 ,第11期

机译：用话语和音节级韵律特征对自动语音情感识别的关注和特征选择
3. Audio-visual feature fusion via deep neural networks for automatic speech recognition [J] . Mohammad Hasan Rahmani, Farshad Almasganj, Seyyed Ali Seyyedsalehi Digital Signal Processing . 2018 ,第期

机译：通过深度神经网络进行视听功能融合，用于自动语音识别
4. Audio-visual feature fusion via deep neural networks for automatic speech recognition [J] . Mohammad Hasan Rahmani, Farshad Almasganj, Seyyed Ali Seyyedsalehi Digital Signal Processing . 2018 ,第期

机译：通过深度神经网络进行视听功能融合，用于自动语音识别
5. Audio Features Selection for Automatic Height Estimation from Speech [C] . Todor Ganchev, Iosif Mporas, Nikos Fakotakis Artificial intelligence: Theories, models and applications . 2010

机译：从语音自动估计高度的音频功能选择
6. Ensemble feature selection for multi-stream automatic speech recognition. [D] . Gelbart, David. 2008

机译：集成特征选择，用于多流自动语音识别。
7. Audio-Based System for Automatic Measurement of Jump Height in Sports Science [O] . Basilio Pueo, Jose J. Lopez, Jose M. Jimenez-Olmedo 2019

机译：基于音频的体育科学跳跃高度自动测量系统
8. AUDIO-VISUAL FEATURE INTEGRATION BASED ON PIECEWISE LINEAR TRANSFORMATION FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION [O] . Yosuke Kashiwagi, Masayuki Suzuki, Nobuaki Minematsu, 2013

机译：基于分段线性变换的音频—视觉特征集成鲁棒自动语音识别

Audio Features Selection for Automatic Height Estimation from Speech

摘要

著录项

相似文献

相关主题

期刊订阅