Multistream Recognition of Speech: Dealing With Unknown Unknowns

Hermansky H.

首页> 外文期刊>Proceedings of the IEEE >Multistream Recognition of Speech: Dealing With Unknown Unknowns

【24h】

Multistream Recognition of Speech: Dealing With Unknown Unknowns

机译：语音的多流识别：处理未知未知数

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper discusses an approach for dealing with unexpected acoustic elements in speech. The approach is motivated by observations of human performance on such problems, which indicate the existence of multiple parallel processing streams in the human speech processing cognitive system, combined with the human ability to know when the correct information is being received. Some earlier relevant engineering approaches in multistream automatic recognition of speech (ASR) that aimed at processing of noisy speech and at dealing with unexpected out-of-vocabulary words are reviewed. The paper also reviews some currently active research in multistream ASR, focusing mainly on feedback-based techniques involving fusion of information between individual processing streams. The difference between the system behavior on its training data and during its operation is proposed as a substitute for the human ability of “knowing when knowing.” Most recent results indicate 9% relative improvement in error rates in phoneme recognition of high signal-to-noise ratio speech and as high as 30% relative improvements in moderate noise.

机译：本文讨论了一种处理语音中意外的声学元素的方法。该方法是通过观察人类对此类问题的表现来激发的，这表明人类语音处理认知系统中存在多个并行处理流，并结合了人类知道何时接收到正确信息的能力。审查了一些早期的有关语音多流自动识别（ASR）的相关工程方法，该方法旨在处理嘈杂的语音并处理意外的语音外单词。本文还回顾了当前在多流ASR中的一些活跃研究，主要关注基于反馈的技术，这些技术涉及各个处理流之间的信息融合。有人建议将系统在其训练数据上的行为与运行期间的行为之间的差异替换为人类“知道时知道”的能力。最新的结果表明，在高信噪比语音的音素识别中，错误率的相对改善为9％，在中等噪声中的相对改善高达30％。

著录项

来源
《Proceedings of the IEEE》 |2013年第5期|1076-1088|共13页
作者
Hermansky H.;
展开▼
作者单位

Center for Language and Speech Processing, Johns Hopkins University, Baltimore, MD , USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Acoustic signal processing; Audio systems; Context awareness; Information processing; Noise measurement; Speech processing; Speech recognition; Auditory perception; confidence measures; machine learning; speech recognition; unexpected information;

机译：声音信号处理;音响系统;上下文意识;信息处理;噪声测量;语音处理;语音识别;听觉感知;信心措施;机器学习语音识别;意外信息;

相似文献

外文文献
中文文献
专利

1. Dealing with the unknown - addressing challenges in evaluating unintelligible speech [J] . Clinical linguistics & phonetics . 2020,第1a3期

机译：处理未知 - 解决挑战在评估易于语音方面
2. Trust Erosion: Dealing with Unknown-Unknowns in Cloud Security [J] . David A. Maluf, Raghuram S. Sudhaakar, Kim-Kwang Raymond Choo Cloud Computing, IEEE . 2018,第4期

机译：信任侵蚀：处理云安全中的未知未知
3. ORGANIZATIONAL CONDITIONS FOR DEALING WITH THE UNKNOWN UNKNOWN [J] . Catrien J.A.M. Termeer, Margo A. van den Brink Public management review . 2013,第1期

机译：处理未知的组织条件
4. Dealing with Unknown Unknowns: Identification and Selection of Minimal Sensing for Fractional Dynamics with Unknown Inputs [C] . Gaurav Gupta, Sérgio Pequito, Paul Bogdan 2018 Annual American Control Conference . 2018

机译：处理未知未知数：识别和选择具有未知输入的分数动态的最小感测
5. A Practical and Efficient Multistream Framework for Noise Robust Speech Recognition [D] . Mallidi, Sri Harish. 2018

机译：实用高效的多流噪声鲁棒语音识别框架
6. A Multistream Feature Framework Based on Bandpass Modulation Filtering for Robust Speech Recognition [O] . Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali -1

机译：在带通滤波调制多流功能根据框架鲁棒语音识别
7. Dealing with the unknown. A proposal for a method for redistributing skeletons of unknown sex and age in an assemblage [O] . Tony Waldron 2012

机译：处理未知事件。关于在一个集合中重新分配未知性别和年龄的骨骼的方法的提议

Multistream Recognition of Speech: Dealing With Unknown Unknowns

摘要

著录项

相似文献

相关主题

期刊订阅