Recognition of Speech from Live Sports Coverage Using Acoustic and Language Model Adaptation

Yasuo Ariki; Jim Ogata; Masakiyo Fujimoto; Kiyoshi Tsukada

首页> 外文期刊>Systems and Computers in Japan >Recognition of Speech from Live Sports Coverage Using Acoustic and Language Model Adaptation

【24h】

Recognition of Speech from Live Sports Coverage Using Acoustic and Language Model Adaptation

机译：使用声学和语言模型自适应从实时体育报道中识别语音

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we use large vocabulary continuous speech recognition to detect highlight scenes in live radio coverage of baseball games and extract information for indexing (keywords). To make the speech recognition unit more robust we perform MLLR and MAP adaptation of the acoustic models proposing a two-level adaptation procedure using both supervised and unsupervised adaptation. By performing acoustic model adaptation, we are able to adapt to the announcer's speaking style and the recording environment, achieving an improvement of 28% in word accuracy over the baseline. Regarding the language model, we performed adaptation via language model fusion, created classes for players' and commentators' names, and adjusted the pronunciation dictionary, resulting in an improvement of 13% in word accuracy over the baseline. By combined adaptation of both models we achieved an improvement of 38% in word accuracy.

机译：在本文中，我们使用大词汇量连续语音识别技术来检测棒球比赛实况广播中的精彩场面，并提取索引信息（关键字）。为了使语音识别单元更健壮，我们对声学模型执行MLLR和MAP自适应，提出了使用监督和非监督自适应的两级自适应程序。通过执行声学模型调整，我们可以适应播音员的讲话风格和录音环境，与基线相比，单词准确性提高了28％。关于语言模型，我们通过语言模型融合进行了改编，为演奏者和评论者的名字创建了类，并调整了发音词典，从而使词的准确性比基线提高了13％。通过两种模型的组合改编，我们在单词准确性方面提高了38％。

著录项

来源
《Systems and Computers in Japan》 |2005年第8期|共9页
作者
Yasuo Ariki; Jim Ogata; Masakiyo Fujimoto; Kiyoshi Tsukada;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
Live broadcast speech; Acoustic model adaptation; Language model adaptation; Speech rate; Language model fusion;

机译：实况转播语音;声学模型自适应;语言模型自适应;语音率;语言模型融合;

相似文献

外文文献
中文文献
专利

1. Recognition of Speech from Live Sports Coverage Using Acoustic and Language Model Adaptation [J] . Yasuo Ariki, Jim Ogata, Masakiyo Fujimoto, Systems and Computers in Japan . 2005,第8期

机译：使用声学和语言模型自适应从实时体育报道中识别语音
2. Sports live speech recognition using acoustic and language model adaptation -application to highlight scene detection [J] . Takeru Shigemori, Tsuyoshi Kaneko, Jun Ogata, 電子情報通信学会技術研究報告. 音声. Speech . 2002,第618期

机译：使用声音和语言模型自适应的体育现场语音识别-突出显示场景检测的应用
3. Sports live speech recognition using acoustic and language model adaptation -application to highlight scene detection [J] . Takeru Shigemori, Tsuyoshi Kaneko, Jun Ogata, 電子情報通信学会技術研究報告. 音声. Speech . 2002,第618期

机译：运动实时语音识别使用声学和语言模型适应 - 应用程序突出显示场景检测
4. Live Speech Recognition in Sports Games by Adaptation of Acoustic Model and Language Model [C] . Yasuo Ariki, Takeru Shigemori, Tsuyoshi Kaneko, European Conference on Speech Communication and Technology . 2003

机译：通过声学模型和语言模型的运动游戏实时演讲识别
5. Acoustic model and adaptation for automatic speech recognition and animal vocalization classification. [D] . Tao, Jidong. 2009

机译：自动语音识别和动物发声分类的声学模型和自适应。
6. Retrospective Analysis of Clinical Performance of an Estonian Speech Recognition System for Radiology: Effects of Different Acoustic and Language Models [O] . A. Paats, T. Alumäe, E. Meister, 2018

机译：一项爱沙尼亚放射线语音识别系统临床表现的回顾性分析：不同声学和语言模型的影响
7. End-to-End Speech Endpoint Detection Utilizing Acoustic and Language Modeling Knowledge for Online Low-Latency Speech Recognition [O] . Inyoung Hwang, Joon-Hyuk Chang 2020

机译：利用声学和语言建模知识进行在线低延迟语音识别的端到端语音端点检测

Recognition of Speech from Live Sports Coverage Using Acoustic and Language Model Adaptation

摘要

著录项

相似文献

相关主题

期刊订阅