首页> 外文期刊>IEICE Transactions on Information and Systems >Adaptation to Pronunciation Variations in Indonesian Spoken Query-Based Information Retrieval
【24h】

Adaptation to Pronunciation Variations in Indonesian Spoken Query-Based Information Retrieval

机译:印尼口语基于查询的信息检索中对语音变体的适应

获取原文
获取原文并翻译 | 示例
       

摘要

Recognition errors of proper nouns and foreign words significantly decrease the performance of ASR-based speech applications such as voice dialing systems, speech summarization, spoken document retrieval, and spoken query-based information retrieval (IR). The reason is that proper nouns and words that come from other languages are usually the most important key words. The loss of such words due to misrecog-nition in turn leads to a loss of significant information from the speech source. This paper focuses on how to improve the performance of Indonesian ASR by alleviating the problem of pronunciation variation of proper nouns and foreign words (English words in particular). To improve the proper noun recognition accuracy, proper-noun specific acoustic models are created by supervised adaptation using maximum likelihood linear regression (MLLR). To improve English word recognition, the pronunciation of English words contained in the lexicon is fixed by using rule-based English-to-Indonesian phoneme mapping. The effectiveness of the proposed method was confirmed through spoken query based Indonesian IR. We used Inference Network-based (IN-based) IR and compared its results with those of the classical Vector Space Model (VSM) IR, both using a tf-idf weighting schema. Experimental results show that IN-based IR outperforms VSM IR.
机译:专有名词和外来词的识别错误会大大降低基于ASR的语音应用程序的性能,例如语音拨号系统,语音摘要,语音文档检索以及基于语音查询的信息检索(IR)。原因是来自其他语言的专有名词和单词通常是最重要的关键字。由于误认导致的此类单词的丢失进而导致语音源中大量信息的丢失。本文着重探讨如何通过缓解专有名词和外来词(尤其是英语单词)的发音变化问题来提高印尼ASR的性能。为了提高专有名词识别的准确性,通过使用最大似然线性回归(MLLR)进行有监督的自适应来创建专有名词特定的声学模型。为了提高英语单词的识别能力,使用基于规则的英语到印尼语音素映射来固定词典中包含的英语单词的发音。通过基于口语查询的印尼语IR证实了该方法的有效性。我们使用了基于推理网络(基于IN)的IR,并将其结果与经典向量空间模型(VSM)IR的结果进行了比较,均使用tf-idf加权方案。实验结果表明,基于IN的IR优于VSM IR。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号