Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System

机译：会议认可的进一步进展：ICSI-SRI Spring 2005演讲到文本评估系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe the development of our speech recognition system for the National Institute of Standards and Technology (NIST) Spring 2005 Meeting Rich Transcription (RT-05S) evaluation, highlighting improvements made since last year [1]. The system is based on the SRI-ICSI-UW RT-04F conversational telephone speech (CTS) recognition system, with meeting-adapted models and various audio preprocessing steps. This year’s system features better delay-sum processing of distant microphone channels and energy-based crosstalk suppression for close-talking microphones. Acoustic modeling is improved by virtue of various enhancements to the background (CTS) models, including added training data, decision-tree based state tying, and the inclusion of discriminatively trained phone posterior features estimated by multilayer perceptrons. In particular, we make use of adaptation of both acoustic models and MLP features to the meeting domain. For distant microphone recognition we obtained considerable gains by combining and cross-adapting narrow-band (telephone) acoustic models with broadband (broadcast news) models. Language models (LMs) were improved with the inclusion of new meeting and web data. In spite of a lack of training data, we created effective LMs for the CHIL lecture domain. Results are reported on RT-04S and RT-05S meeting data. Measured on RT-04S conference data, we achieved an overall improvement of 17% relative in both MDM and IHM conditions compared to last year’s evaluation system. Results on lecture data are comparable to the best reported results for that task.

机译：我们描述了我们为国家标准和技术研究所（NIST）春季2005年富人转录（RT-05S）评估的发展，突出了自去年以来的改进[1]。该系统基于SRI-ICSI-UW RT-04F会话电话语音（CTS）识别系统，具有满足适应的模型和各种音频预处理步骤。今年的系统采用近距离麦克风通道和基于能量的串扰抑制的更好的延迟处理，用于近距离谈话的麦克风。通过对背景（CTS）模型的各种增强，包括添加训练数据，基于决策树的状态捆绑的各种增强，以及包含多层训练的训练的电话后续特征，改善了声学建模。特别是，我们利用声学模型和MLP功能的调整到会议域。对于远处的麦克风识别，我们通过将具有宽带（广播新闻）模型的窄带（电话）声学模型组合和交叉调整窄带（电话）声学模型来获得相当大的增益。在包含新的会议和Web数据的情况下，改进了语言模型（LMS）。尽管缺乏培训数据，但我们为Chil讲座域创建了有效的LMS。结果在RT-04S和RT-05S会议数据上报告。与去年的评估系统相比，在RT-04S会议数据上测量，我们在MDM和IHM条件下实现了17％的总体提高。讲座数据的结果与该任务的最佳报告结果相当。

著录项

来源
《International Workshop on Machine Learning for Multimodal Interaction》|2006年||共13页
会议地点
作者
Andreas Stolcke; Xavier Anguera; Kofi Boakye; Ozgür Cetin; Frantisek Grézl; Adam Janin; Arindam Mandal; Barbara Peskin; Chuck Wooters; Jing Zheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Meeting Assistant System Berbasis Teknologi Speech-to-Text [J] . Daniel Soesanto, Budi Hartanto, Melisa Teknika . 2021,第1期

机译：会议助理系统Berbasis Teknologi演讲到文本
2. 2005 E-MRS Spring Meeting Examined Broad Spectrum of Materials Science [J] . JOHN BLIZZARD MRS bulletin . 2005,第11期

机译：2005年E-MRS春季会议审查了广泛的材料科学
3. 2005 MRS Spring Meeting Mixes the Aesthetics and Science of Materials Research [J] . MRS bulletin . 2005,第6期

机译：2005 MRS春季会议融合了材料研究的美学和科学
4. Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System [C] . Andreas Stolcke, Xavier Anguera, Kofi Boakye, International Workshop on Machine Learning for Multimodal Interaction . 2006

机译：会议认可的进一步进展：ICSI-SRI Spring 2005演讲到文本评估系统
5. Advances in hydrocarbon synthesis: 1. Progress in the living polyhomologation reaction: New monomers, catalysts and architectures 2. Progress in synthesis and evaluation of potential nestmate recognition cues of the Argentine Ant (Linepithema humile). [D] . Sulc, Robert. 2008

机译：碳氢化合物合成方面的进展：1.活性多同源反应的进展：新的单体，催化剂和结构2.阿根廷蚂蚁（Linepithema humile）的潜在巢式识别线索的合成和评估进展。
6. A focus on cross-purpose tools automated recognition of study design in multiple disciplines and evaluation of automation tools: a summary of significant discussions at the fourth meeting of the International Collaboration for Automation of Systematic Reviews (ICASR) [O] . Annette M. O’Connor, Paul Glasziou, Michele Taylor, 2020

机译：着眼于跨用途工具对多学科研究设计的自动识别以及对自动化工具的评估：国际系统评价自动化协作组织（ICASR）第四次会议上的重要讨论摘要
7. Further progress in meeting recognition: The ICSI-SRI Spring 2005 speech-to-text evaluation system [O] . Andreas Stolcke, Xavier Anguera, Kofi Boakye, 2005

机译：会议认可的进一步进展：ICsI-sRI 2005春季语音文本评估系统
8. Advanced Language Recognition using Cepstra and Phonotactics: MITLL System Performance on the NIST 2005 Language Recognition Evaluation. [R] . Campbell, W. M., Gleason, T., Navratil, J., 2016

机译：使用Cepstra和phonotactics进行高级语言识别：NIsT 2005语言识别评估中的mITLL系统性能。

Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System

摘要

著录项

相似文献

相关主题

期刊订阅