Improvements in Japanese Voice Search

机译：日语语音搜索的改进

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes work on Japanese voice-search at Yahoo! Japan. We first describe several implementation details of our WFST-based internal decoder which make the voice-search task more efficient including a simple, but effective, compressed WFST arc representation. This permits a ~2Gb memory decoder process for a 1 million word vocabulary and 35 million N-gram language model. We then describe our baseline system using the decoder and compare it against two open-source decoders, Juicer and Julius. We also describe our initial attempts to adapt the baseline system through simple language model adaptation using manually transcribed anonymized voice queries. To achieve this we present a sequence of WFST operations which preserve consistency of segmentation between manual and automatic transcriptions. We show that even using this simple adaptation method we obtain a relative reduction of up to 4.6% in sentence error rate and 8.2% in character error rate.

机译：本文介绍了日本语音搜索的工作，在雅虎！日本。我们首先描述了基于WFST的内部解码器的几个实现细节，使语音搜索任务更有效，包括简单但有效，压缩的WFST弧表示。这允许为100万字词汇和3500万N-GRAM语言模型的〜2GB内存解码器过程。然后，我们使用解码器描述我们的基线系统，并将其与两个开源解码器，榨汁机和朱叶进行比较。我们还通过手动转录的匿名语音查询来描述通过简单的语言模型适应来调整基线系统的初步尝试。为了实现这一目标，我们呈现了一系列WFST操作，它保留了手动和自动转录之间的分割的一致性。我们表明，即使使用这种简单的适应方法，我们也可以在句子错误率中获得高达4.6％的相对减少，并且字符错误率为8.2％。

著录项

来源
《INTERSPEECH 2012》|2012年||共4页
会议地点
作者
Ken-ichi Iso; Edward Whittaker; Tadashi Emori; Jumpei Miyake;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 73.4136083;
关键词
ASR; Japanese; voice search; WFST;

机译：ASR;日语;语音搜索;WFST;

相似文献

外文文献
中文文献
专利

1. Japanese Question-Answering System Using A~* Search and Its Improvement [J] . TATSUNORI MORI ACM transactions on Asian language information processing . 2005,第3期

机译：使用A〜*搜索的日语问答系统及其改进
2. Psychophysical Boundary for Categorization of Voiced-Voiceless Stop Consonants in Native Japanese Speakers [J] . Tamura Shunsuke, Ito Kazuhito, Hirose Nobuyuki, Journal of speech, language, and hearing research: JSLHR . 2018,第3期

机译：日本州扬声器中浊音停止辅音的心理物理边界
3. Similarity Search of Acted Voices for Automatic Voice Casting [J] . Nicolas Obin, Axel Roebel Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第9期

机译：代理语音的相似性搜索以自动投放语音
4. Improvements in Japanese Voice Search [C] . Ken-ichi Iso, Edward Whittaker, Tadashi Emori, Annual conference of the International Speech Communication Association . 2012

机译：日语语音搜索的改进
5. Experiences of Japanese civilians living near a United States military base: Understanding United States-Japanese relations through the voices of citizens. [D] . Gelder, Melinda R. 2005

机译：生活在美国军事基地附近的日本平民的经验：通过公民的声音了解美日关系。
6. Voice onset time is necessary but not always sufficient to describe acquisition of voiced stops: The cases of Greek and Japanese [O] . Eun Jong Kong, Mary E. Beckman, Jan Edwards -1

机译：语音发起时间是必要的但并不总是足以描述浊音停止的收购：希腊语和日语的案例
7. Japanese Language Learnersu27 Listening Ability of Voiced-Unvoiced Sounds and Special Mora of the Japanese Dialects [O] . 堀口純子, ホリグチスミコ, Horiguchi Sumiko 1993

机译：日语学习者 u27清音的听觉能力和日语方言的特殊道德

Improvements in Japanese Voice Search

摘要

著录项

相似文献

相关主题

期刊订阅