Expansion of training texts to generate a topic-dependent language model for meeting speech recognition

机译：扩展培训文本以生成与主题相关的语言模型以实现语音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes expansion methods for training texts (baseline) to generate a topic-dependent language model for more accurate recognition of meeting speech. To prepare a universal language model that can cope with the variety of topics discussed in meetings is very difficult. Our strategy is to generate topic-dependent training texts based on two methods. The first is text collection from web pages using queries that consist of topic-dependent confident terms; these terms were selected from preparatory recognition results based on the TF-IDF (TF; Term Frequency, IDF; Inversed Document Frequency) values of each term. The second technique is text generation using participants' names. Our topic-dependent language model was generated using these new texts and the baseline corpus. The language model generated by the proposed strategy reduced the perplexity by 16.4% and out-of-vocabulary rate by 37.5%, respectively, compared with the language model that used only the baseline corpus. This improvement was confirmed through meeting speech recognition as well.

机译：本文提出了用于训练文本（基线）的扩展方法，以生成与主题相关的语言模型，以更准确地识别会议语音。准备一个通用的语言模型以应对会议中讨论的各种主题非常困难。我们的策略是基于两种方法生成与主题相关的培训文本。首先是使用包含主题相关的置信术语的查询从网页中收集文本；这些术语是根据每个术语的TF-IDF（TF；术语频率，IDF；反向文档频率）值从预备识别结果中选择的。第二种技术是使用参与者的姓名生成文本。我们使用这些新文本和基准语料库生成了与主题相关的语言模型。与仅使用基准语料库的语言模型相比，该策略所生成的语言模型分别使困惑度降低了16.4％，词汇率降低了37.5％。通过会议语音识别也证实了这种改进。

著录项

来源
《2012 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference.》|2012年|p.1-4|共4页
会议地点 Hollywood CA(US);Hollywood CA(US)
作者
Egashira Kazushige; Kojima Kazuya; Yamashita Masaru; Yamauchi Katsuya; Matsunaga Shoichi;
展开▼
作者单位

Nagasaki University, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信号处理;信号处理;计算机的应用;
关键词
入库时间 2022-08-26 14:26:11

相似文献

外文文献
中文文献
专利

1. Enhancing Comprehension of Lecture Content in a Foreign Language as the Medium of Instruction: Comparing Speech-to-Text Recognition With Speech-Enabled Language Translation [J] . Rustam Shadiev, Yu-Cheng Chien, Yueh-Min Huang SAGE Open . 2020,第3期

机译：以外语为讲座内容的理解为教学媒介：将语音到文本识别与启用语音的语言翻译进行比较
2. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE transactions on information and systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
3. Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition [J] . Arata ITOH, Sunao HARA, Norihide KITAOKA, IEICE Transactions on Information and Systems . 2012,第10期

机译：使用由MLLR转换生成的伪扬声器特征进行声学模型训练，以实现与扬声器无关的可靠语音识别
4. Expansion of training texts to generate a topic-dependent language model for meeting speech recognition [C] . Egashira Kazushige, Kojima Kazuya, Yamashita Masaru, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2012

机译：扩展培训文本以生成主题依赖语言模型，用于迎接语音识别
5. Discriminative training of language models for speech recognition . [D] . Magdin, Vladimir. 2010

机译：语音识别语言模型的判别训练。
6. Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition [O] . Edvin Pakoci, Branislav Popović, Darko Pekar 2019

机译：在塞尔维亚大型词汇语音识别的语言建模中使用形态学数据
7. Speech-Driven Text Retrieval: Using Target IR Collections for Statistical Language Model Adaptation in Speech Recognition [O] . Fujii, Atsushi, Itou, Katunobu, Ishikawa, Tetsuya 2002

机译：语音驱动的文本检索：使用目标IR集合语音识别中的统计语言模型适应

Expansion of training texts to generate a topic-dependent language model for meeting speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅