Modeling a Noisy-channel for Voice Conversion Using Articulatory Features

机译：使用发音特征对噪声通道建模以进行语音转换

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose modeling a noisy-channel for the task of voice conversion (VC). We have used the artificial neural networks (ANN) to capture speaker-specific characteristics of a target speaker which avoid the need for any training utterance from a source speaker. We use articulatory features (AFs) as a canonical form or speaker-independent representation of a speech signal. Our studies show that AFs contain a significant amount of speaker information in their trajectories. Suitable techniques are proposed to normalize the speaker-specific information in AF trajectories and the resultant AFs are used in voice conversion. The results of voice conversion evaluated using objective and subjective measures confirm that AFs can be used as a canonical form in nosiy-channel to capture speaker-specific characteristics of a target speaker.

机译：在本文中，我们建议为语音转换（VC）的任务建模一个噪声通道。我们已经使用人工神经网络（ANN）来捕获目标说话者的说话者特定特征，从而避免了源说话者的任何训练说话。我们使用发音特征（AF）作为语音信号的规范形式或与说话者无关的表示形式。我们的研究表明，AF在其轨迹中包含大量说话人信息。提出了合适的技术来标准化AF轨迹中的说话者特定信息，并且所得到的AF被用于语音转换。使用客观和主观措施评估的语音转换结果证实，AF可以用作嘈杂通道中的规范形式，以捕获目标说话者的特定说话者特征。

著录项

来源
《Annual conference of the International Speech Communication Association》|2012年|2199-2202|共4页
会议地点
作者
Bajibabu Bollepalli; Alan W Black; Kishore Prahallad;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
voice conversion; articulatory features; noisy-channel model; speaker-independent representation;

机译：语音转换;发音特征;噪声通道模型;独立于说话人的代表;

相似文献

外文文献
中文文献
专利

1. Mapping Articulatory-Features to Vocal-Tract Parameters for Voice Conversion [J] . Narpendyah Wisjnu ARIWARDHANI, Masashi KIMURA, Yurie IRIBE, IEICE transactions on information and systems . 2014,第4期

机译：将发音特征映射到人声参数以进行语音转换
2. Voice onset time versus articulatory modeling for stop consonants. [J] . Rothenberg M Logopedics, phoniatrics, vocology. . 2009,第4期

机译：语音起始时间与发音模型的终止辅音比较。
3. A pitch pattern modeling technique using dynamic features on the border of voiced and unvoiced segments [J] . Heiga Zen, Keiichi Tokuda, Takashi Masuko, 電子情報通信学会技術研究報告. 信号処理. Signal Processing . 2001,第323期

机译：在有声和无声段的边界上使用动态特征的音高模式建模技术
4. Modeling a Noisy-channel for Voice Conversion Using Articulatory Features [C] . Bajibabu Bollepalli, Alan W Black, Kishore Prahallad INTERSPEECH 2012 . 2012

机译：使用明晰度特征来模拟语音转换的嘈杂通道
5. Discriminative Articulatory Feature-based Pronunciation Models with Application to Spoken Term Detection [D] . Prabhavalkar, Rohit. 2013

机译：基于区分性发音特征的语音模型及其在口语检测中的应用
6. Affective Voice Interaction and Artificial Intelligence: A Research Study on the Acoustic Features of Gender and the Emotional States of the PAD Model [O] . Kuo-Liang Huang, Sheng-Feng Duan, Xi Lyu 2021

机译：情感语音互动与人工智能：性别声学特征的研究与垫模型的情绪状态
7. HIGH ACCURATE MODEL-INTEGRATION-BASED VOICE CONVERSION USING DYNAMIC FEATURES AND MODEL STRUCTURE OPTIMIZATION [O] . Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, 2013

机译：基于动态特征和模型结构优化的高精度基于模型集成的语音转换

Modeling a Noisy-channel for Voice Conversion Using Articulatory Features

摘要

著录项

相似文献

相关主题

期刊订阅