Artificial stereo data generation for speech feature mapping

机译：用于语音特征映射的人工立体声数据生成

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Feature mapping technique is widely used to eliminate the mismatch between the training and test conditions of speech recognition. In the feature mapping, a target (mismatched) feature vector sequence is mapped closer to the corresponding reference (matched) feature vector stream. The training of the mapping system is usually carried out based on a set of stereo data which consists of simultaneous recordings obtained in both the reference and target conditions. In this paper, we propose a novel approach to blind parameter estimation which does not require the reference feature vectors. The proposed approach is motivated by the hidden Markov model (HMM)-based speech synthesis algorithm.

机译：特征映射技术被广泛用于消除语音识别的训练和测试条件之间的不匹配。在特征映射中，将目标（不匹配）特征向量序列映射为更靠近相应的参考（匹配）特征向量流。映射系统的训练通常基于一组立体声数据进行，该立体声数据由在参考条件和目标条件下同时获得的记录组成。在本文中，我们提出了一种新颖的盲参数估计方法，该方法不需要参考特征向量。所提出的方法是基于基于隐马尔可夫模型（HMM）的语音合成算法的启发。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP》|2012年|p.4897- 4900|共4页
会议地点 Kyoto(JP)
作者
Han, Chang Woo;
展开▼
作者单位

School of Electrical Engineering and INMC Seoul National University 151-742 Korea;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Single Channel Dereverberation by Feature Mapping Using Limited Stereo Data [J] . Aditya Arie NUGRAHA, Kazumasa YAMAMOTO, Seiichi NAKAGAWA 電子情報通信学会技術研究報告. 音声. Speech . 2013,第161期

机译：通过使用有限立体声数据进行特征映射的单通道去混响
2. Efficient online target speech extraction using DOA-constrained independent component analysis of stereo data for robust speech recognition [J] . Minook Kim, Hyung-Min Park Signal processing . 2015,第deca期

机译：使用DOA约束的立体声数据独立分量分析进行有效的在线目标语音提取，以实现可靠的语音识别
3. Speech compensation using Stereo Based Stochastic Vector Mapping based on Full Covariance Models [J] . Randa Al-Wakeel, Mahmoud Shoman, Magdy Aboul-Ela, International Journal of Information and Communication Technology Research . 2014,第8期

机译：基于完全协方差模型的基于立体声的随机矢量映射的语音补偿
4. Artificial stereo data generation for speech feature mapping [C] . Chang Woo Han, Tae Gyoon Kang, Shin Jae Kang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：语音特征映射的人工立体声数据生成
5. DTM generation from digitized aerial photos of a complex scene by employing pattern recognition with the Fourier transform and multiresolutional feature-based stereo matching. [D] . Huang, Yishuo. 1996

机译：通过使用具有傅立叶变换的模式识别和基于多分辨率特征的立体声匹配，从复杂场景的数字化航拍照片生成DTM。
6. Nematode.net update 2011: addition of data sets and tools featuring next-generation sequencing data [O] . John Martin, Sahar Abubucker, Esley Heizer, 2012

机译：Nematode.net 2011年更新：添加了具有下一代测序数据的数据集和工具
7. GeoNat v1.0: A dataset for natural feature mapping with artificial intelligence and supervised learning [O] . Samantha T. Arundel, Wenwen Li, Sizhe Wang 2020

机译：Geonat v1.0：具有人工智能和监督学习的自然特征映射的数据集
8. Photogrammetric GIS Technology: Feature Mapping on Digital Stereo Imagery [R] . Brown, R. O. 1991

机译：摄影测量GIs技术：数字立体图像的特征映射

Artificial stereo data generation for speech feature mapping

摘要

著录项

相似文献

相关主题

期刊订阅