Incremental Language Modeling for Automatic Transcription of Broadcast News

Katsutoshi OHTSUKI; Long NGUYEN

首页> 外文期刊>IEICE Transactions on Information and Systems >Incremental Language Modeling for Automatic Transcription of Broadcast News

【24h】

Incremental Language Modeling for Automatic Transcription of Broadcast News

机译：广播新闻自动转录的增量语言建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we address the task of incremental language modeling for automatic transcription of broadcast news speech. Daily broadcast news naturally contains new words that are not in the lexicon of the speech recognition system but are important for downstream applications such as information retrieval or machine translation. To recognize those new words, the lexicon and the language model of the speech recog-nition system need to be updated periodically. We propose a method of estimating a list of words to be added to the lexicon based on some time-series text data. The experimental results on the RT04 Broadcast News data and other TV audio data showed that this method provided an impressive and stable reduction in both out-of-vocabulary rates and speech recognition word error rates.

机译：在本文中，我们解决了用于广播新闻语音自动转录的增量语言建模的任务。每日广播新闻自然包含新词，这些词不在语音识别系统的词典中，但对于下游应用程序（如信息检索或机器翻译）很重要。为了识别这些新单词，语音识别系统的词典和语言模型需要定期更新。我们提出了一种基于一些时间序列文本数据来估计要添加到词典中的单词列表的方法。在RT04广播新闻数据和其他电视音频数据上的实验结果表明，该方法可显着稳定地降低出语音率和语音识别单词错误率。

著录项

来源
《IEICE Transactions on Information and Systems》 |2007年第2期|p.526-532|共7页
作者
Katsutoshi OHTSUKI; Long NGUYEN;
展开▼
作者单位

NTT Cyber Space Laboratories, NTT Corporation, Yokosuka-shi, 239-0847 Japan;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
speech recognition; out-of-vocabulary; language model; broadcast news;

机译：语音识别;词汇外;语言模型;广播新闻;
入库时间 2022-08-18 00:28:21

相似文献

外文文献
中文文献
专利

1. Unsupervised stemmed text corpus for language modeling and transcription of Telugu broadcast news [J] . Mythilisharan Pala, Laxminarayana Parayitam, Venkataramana Appala International journal of speech technology . 2020,第3期

机译：无监督的威胁文本语料库语言建模和泰卢语播放新闻的转录
2. Modeling of Slovak Language for Broadcast News Transcription [J] . Journal of Electrical and Electronics Engineering . 2015,第2期

机译：广播新闻转录的斯洛伐克语语言建模
3. A New Language Model Using (n > 4)-gram for Broadcast News Speech Transcription [J] . Naoto Katoh, Noriyoshi Uratani, Terumasa Ehara, Systems and Computers in Japan . 2005,第1期

机译：使用（n> 4）-gram进行广播新闻语音转录的新语言模型
4. DYNAMIC LANGUAGE MODELING FOR A DAILY BROADCAST NEWS TRANSCRIPTION SYSTEM [C] . Ciro Martins, Antonio Teixeira, Joao Neto IEEE Workshop on Automatic Speech Recognition and Understanding . 2007

机译：日常广播新闻转录系统的动态语言建模
5. Automatic Broadcast News speech summarization. [D] . Maskey, Sameer Raj. 2008

机译：自动广播新闻语音摘要。
6. A Reinforcement Learning Model Equipped with Sensors for Generating Perception Patterns: Implementation of a Simulated Air Navigation System Using ADS-B (Automatic Dependent Surveillance-Broadcast) Technology [O] . Santiago Álvarez de Toledo, Aurea Anguera, José M. Barreiro, 2017

机译：配备用于生成感知模式的传感器的强化学习模型：使用ADS-B（自动相关监视-广播）技术的模拟空中导航系统的实现
7. Improved Modeling and Efficiency for Automatic Transcription of Broadcast News [O] . Ananth Sankar, Venkata Ramana Rao Gadde, Andreas Stolcke, 2001

机译：广播新闻自动转录的改进模型和效率

Incremental Language Modeling for Automatic Transcription of Broadcast News

摘要

著录项

相似文献

相关主题

期刊订阅