Developing a method to build Japanese speech recognition system based on 3-gram language model expansion with Google database

机译：通过Google数据库开发基于3克语言模型扩展构建日语语音识别系统的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We have developed a method to build a Japanese automatic speech recognition (ASR) system based on 3-gram language model expansion with the Google database. Our aim is to enhance the recognition accuracy of ASR systems based on the 3-gram language model, even in cases where the language model is trained using short text segments. We investigate a practical approach to expanding language models by using 3-gram information from external web documents. In addition, we filter 3-gram entries on the basis of term frequency-inverse document frequency (TF-IDF) scores and the output of the Yahoo! web API to prevent the unnecessary addition of redundant or irrelevant 3-gram entries. In the experiments, we achieved an improvement of 0.71% in the word error rate and proved that the recognition accuracy can be improved by combining the proposed method and the traditional back-off smoothing technique without any costs being incurred in collecting additional text for training the model.

机译：我们开发了一种基于3克语言模型扩展的日本自动语音识别（ASR）系统的方法，使用Google数据库构建了3克语言模型。我们的目的是提高基于3克语言模型的ASR系统的识别准确性，即使在使用短文本段训练语言模型的情况下也是如此。我们调查通过使用外部Web文档的3克信息来扩展语言模型的实用方法。此外，我们基于术语频率 - 逆文档频率（TF-IDF）分数和雅虎的输出来过滤3克条目。 Web API可防止不必要地添加冗余或无关的3克条目。在实验中，我们以字错误率实现了0.71％的提高，并证明了通过组合所提出的方法和传统的退避平滑技术，可以提高识别准确度，而不会在收集其他文本以进行培训模型。

著录项

来源
《IEEE International Conference on Computer-Aided Industrial Design Conceptual Design》|2014年||共6页
会议地点
作者
Shimada Toshiaki; Nisimura Ryuichi; Tanaka Masayasu; Kawahara Hideki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Google database; Language modeling; Speech recognition system;

机译：谷歌数据库;语言建模;语音识别系统;

相似文献

外文文献
中文文献
专利

1. Comparison of Performance of Enhanced Morpheme-based Language Model with Different Word-based Language Models for Improving the Performance of Tamil Speech Recognition System [J] . S. SARASWATHI, T.V. GEETHA ACM transactions on Asian language information processing . 2007,第3期

机译：增强的基于词素的语言模型与不同的基于单词的语言模型的性能比较，以提高泰米尔语语音识别系统的性能
2. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu Journal of Signal and Information Processing . 2020,第1期

机译：Chaha非常低于资源语言的多语言深神经网络建模方法对自动语音识别系统的研究
3. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu 信号与信息处理（英文） . 2020,第001期

机译：资源非常少的语言Chaha通过多语言深层神经网络建模方法研究自动语音识别系统
4. Developing a method to build Japanese speech recognition system based on 3-gram language model expansion with Google database [C] . Shimada Toshiaki, Nisimura Ryuichi, Tanaka Masayasu, IEEE International Conference on Computer-Aided Industrial Design Conceptual Design . 2014

机译：通过Google数据库开发基于3克语言模型扩展构建日语语音识别系统的方法
5. Giving speech a hand: fMRI of co-speech beat gesture processing in adult native English speakers, Japanese English as a second language speakers, typically-developing children, and children with autism spectrum disorder [D] . Hubbard, Amy L. 2009

机译：提供帮助：成人成年英语母语者，日语作为第二语言讲者，典型发育中的儿童以及患有自闭症谱系障碍儿童的共语音节拍手势处理的功能磁共振成像
6. Knowledge-Based Systems. Methods for Developing and Evaluating Expert Systems: A Language/Action Model of Human-Computer Communication in a Psychiatric Hospital [O] . R. A. Morelli, J. W. Goethe, J. D. Bronzino 1990

机译：基于知识的系统。开发和评估专家系统的方法：精神病医院中人机交流的语言/行为模型
7. A New Word Clustering Method for Building N-Gram Language Models in Continuous Speech Recognition Systems [O] . Mohammad Bahrani, Hossein Sameti, Nazila Hafezi, 2013

机译：连续语音识别系统中构建N-gram语言模型的新词聚类方法

Developing a method to build Japanese speech recognition system based on 3-gram language model expansion with Google database

摘要

著录项

相似文献

相关主题

期刊订阅