Missing-Feature-Theory-based Robust Simultaneous Speech Recognition System with Non-clean Speech Acoustic Model

机译：基于缺失的特征理论的强大立即语音识别系统，具有非清洁语音声学模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A humanoid robot must recognize a target speech signal while people around the robot chat with them in real-world. To recognize the target speech signal, robot has to separate the target speech signal among other speech signals and recognize the separated speech signal. As separated signal includes distortion, automatic speech recognition (ASR) performance degrades. To avoid the degradation, we trained an acoustic model from non-clean speech signals to adapt acoustic feature of distorted signal and adding white noise to separated speech signal before extracting acoustic feature. The issues are (1) To determine optimal noise level to add the training speech signals, and (2) To determine optimal noise level to add the separated signal. In this paper, we investigate how much noises should be added to clean speech data for training and how speech recognition performance improves for different positions of three talkers with soft masking. Experimental results show that the best performance is obtained by adding white noises of 30 dB. The ASR with the acoustic model outperforms with ASR with the clean acoustic model by 4 points.

机译：人形机器人必须识别目标语音信号，而机器人周围的人与他们在现实世界中聊天。为了识别目标语音信号，机器人必须在其他语音信号中分离目标语音信号并识别分离的语音信号。由于分离信号包括失真，自动语音识别（ASR）性能下降。为避免降级，我们从非清洁语音信号训练了声学模型，以使失真信号的声学特征调节，并在提取声学特征之前将白噪声添加到分离的语音信号。问题是（1）确定最佳噪声水平，以添加训练语音信号，（2）以确定最佳噪声水平以添加分离信号。在本文中，我们调查了应对清洁语音数据进行培训的噪音以及语音识别性能如何改善三个讲话者的不同掩码的不同位置。实验结果表明，通过添加30 dB的白色噪声获得了最佳性能。与声学模型的ASR与ASR具有4分的清洁声学模型。

著录项

来源
《International Conference on Intelligent Robotics and Systems》|2009年||共6页
会议地点
作者
Toru Takahashi; Kazuhiro Nakadai; Kazunori Komatani; Tetsuya Ogata; Hiroshi G. Okuno;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP242.6-53;
关键词
入库时间 2022-08-20 21:29:34

相似文献

外文文献
中文文献
专利

1. An End-to-End Deep Learning Approach to Simultaneous Speech Dereverberation and Acoustic Modeling for Robust Speech Recognition [J] . Bo Wu, Kehuang Li, Fengpei Ge, Selected Topics in Signal Processing, IEEE Journal of . 2017,第8期

机译：端到端深度学习方法可同时进行语音去混响和声学建模，以实现可靠的语音识别
2. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
3. Towards Robust Indonesian Speech Recognition with Spontaneous-Speech Adapted Acoustic Models [J] . Devin Hoesen, Cil Hardianto Satriawan, Dessi Puji Lestari, Procedia Computer Science . 2016,第1期

机译：利用自发语音自适应声学模型实现鲁棒的印尼语音识别
4. Missing-feature-theory-based robust simultaneous speech recognition system with non-clean speech acoustic model [C] . Takahashi T., Nakadai K., Komatani K., IEEE/RSJ International Conference on Intelligent Robots and Systems;IROS 2009 . 2009

机译：基于缺失特征理论的鲁棒同时语音识别系统
5. Robust Acoustic Modeling and Front-End Design for Distant Speech Recognition [D] . Mirsamadi, Seyedmahdad. 2017

机译：鲁棒的声学建模和远端语音识别前端设计
6. Retrospective Analysis of Clinical Performance of an Estonian Speech Recognition System for Radiology: Effects of Different Acoustic and Language Models [O] . A. Paats, T. Alumäe, E. Meister, 2018

机译：一项爱沙尼亚放射线语音识别系统临床表现的回顾性分析：不同声学和语言模型的影响
7. Missing-Feature-Theory-Based Robust Simultaneous Speech Recognition System with Non-clean Speech Acoustic Model [O] . Toru Takahashi, Kazuhiro Nakadai, Kazunori Komatani, 2009

机译：基于缺失特征理论的鲁棒语音识别模型的鲁棒同时语音识别系统

Missing-Feature-Theory-based Robust Simultaneous Speech Recognition System with Non-clean Speech Acoustic Model

摘要

著录项

相似文献

相关主题

期刊订阅