An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer

机译：基于参考歌手的高斯混合模型特征提取语音音色评价值的估计方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to voice timbre expression words, such as "Age" and "Gender", and they usually need to be manually assigned to individual singers' singing voices through listening. To make it possible to automatically estimate them from given singer's singing voices, an acoustic feature to well capture only each singer's voice timbre is extracted with a Gaussian mixture model trained using parallel data between singing voices sung by many pre-stored target singers and same voices sung by a reference singer. Then, the voice timbre evaluation values are estimated from the extracted feature using regression models. The experimental results showed that the proposed method is capable of accurately estimating those values for some expression words, such as "Age" and "Gender", and nonlinear regression is effective for the expression words, "Powerfulness" and "Uniqueness."

机译：本文提出了一种用歌声合成系统生成的任意歌手的歌声的音色评价值的估计方法，以发展歌声检索系统。语音音色评估值是与语音音色表达词相对应的数值，例如“年龄”和“性别”，并且通常需要通过收听将它们手动分配给各个歌手的歌声。为了能够从给定歌手的歌声中自动估计声音，使用高斯混合模型提取了仅能很好地捕获每个歌手的声音音色的声学功能，该模型使用了许多预先存储的目标歌手和相同声音演唱的歌声之间的并行数据进行训练由参考歌手演唱。然后，使用回归模型从提取的特征中估计语音音色评估值。实验结果表明，所提出的方法能够准确地估计“年龄”和“性别”等某些表达词的值，非线性回归对于“有力”和“唯一性”等表达词是有效的。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2016年|5265-5269|共5页
会议地点
作者
Soichi Yamane; Kazuhiro Kobayashi; Tomoki Toda; Tomoyasu Nakano; Masataka Goto; Satoshi Nakamura;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Gaussian mixture model; estimation of evaluation values; reference singer; singing voice synthesis; voice timbre;

机译：高斯混合模型;评估值的估计;参考歌手;唱歌声音合成;声音音色;

相似文献

外文文献
中文文献
专利

1. A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval [J] . Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第3期

机译：伴奏声音的鲁棒性歌唱建模及其在歌手识别和基于音色相似性的音乐信息检索中的应用
2. An integrated approach to bearing prognostics based on EEMD-multi feature extraction, Gaussian mixture models and Jensen-Renyi divergence [J] . Rai Akhand, Upadhyay S. H. Applied Soft Computing . 2018,第期

机译：基于EEMD-MULTIOR萃取，高斯混合模型和Jensen-yenyi发散的轴承预测综合方法
3. Voice conversion based on Gaussian processes by using kernels modeling the spectral density with Gaussian mixture models [J] . Bao Jingyi, Xu Ning Modern Physics Letters, B. Condensed Matter Physics, Statistical Physics, Applied Physics . 2018,第34a36期

机译：利用高斯混合模型使用核心模拟谱密度的基于高斯过程的语音转换
4. An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer [C] . Soichi Yamane, Kazuhiro Kobayashi, Tomoki Toda, IEEE International Conference on Acoustics, Speech and Signal Processing . 2016

机译：基于参考歌手的高斯混合模型特征提取的语音SIMBRE评估值的估计方法
5. A Gaussian Mixture-Based Approach to Synthesizing Nonlinear Feature Functions for Automated Object Detection. [D] . Guo, Pei Fang. 2010

机译：一种基于高斯混合的方法，用于合成非线性特征函数以实现自动目标检测。
6. Corrigendum to Automated Feature Extraction in Brain Tumor by Magnetic Resonance Imaging Using Gaussian Mixture Models [O] . Ahmad Chaddad, Markus Luedi, Pascal O. Zinn, 2017

机译：通过使用高斯混合模型的磁共振成像自动提取脑肿瘤特征的勘误
7. A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval [O] . Hiromasa Fujihara, Masataka Goto, Tetsuro Kitahara, 2011

机译：一种对伴奏声音稳健的歌声模型及其在歌唱识别和基于声音 - 音节相似性的音乐信息检索中的应用
8. Automatic Detection of Voice Impairments Due to Vocal Misuse by Means of Gaussian Mixture Models. [R] . Godino-Llorente, J. I., Aguilera-Navarro, S., Gomez- Vilda, P. 2001

机译：利用高斯混合模型自动检测声音误用造成的语音损伤。

An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer

摘要

著录项

相似文献

相关主题

期刊订阅