首页> 外文学位 >Development of an approach to language identification based on language-dependent phone recognition.
【24h】

Development of an approach to language identification based on language-dependent phone recognition.

机译:基于语言相关的电话识别的语言识别方法的开发。

获取原文
获取原文并翻译 | 示例

摘要

The goal of Language Identification (LID) is to quickly and accurately identify the language being spoken. Although the differences among different (spoken) languages are generally large by any sensible measure, automatic language identification remains a major challenge (perhaps indicating the immaturity of the field of speech processing).; Current language identification systems vary greatly in terms of information utilization and system complexity. Understanding all of these approaches in a unified framework is one of the major challenges in automatic language identification. In this dissertation we provide a partial unification by studying the roles of acoustic, phonotactic and prosodic information in a particular system for language identification.; A comparative study was first conducted on a common two-language task (English and Japanese) to get a grasp of these issues. The results from the comparative experiments were used as basis for the development of a general purpose language-identification base-line system.; Within this framework, two novel LID information sources (backward language model and a context-dependent duration model) were introduced. These two models increased language modeling accuracy at a moderate cost in terms of training data. Also, a novel optimization method was introduced to enhance the discrimination between different languages. These methods led to substantial improvements in system performance. Preliminary studies into channel normalization, conversational speech and system adaptation to new languages were also pursued.; A general purpose LID software tool kit was developed based on the algorithm developed in this thesis work. The final LID system developed attained correct rates of 91% (45-second segments) and 77% (ten-second segments) on a commonly used nine-language task. This is one of the best results reported to date on these tasks.
机译:语言识别(LID)的目标是快速,准确地识别所讲的语言。尽管从任何明智的角度来看,不同(说)的语言之间的差异通常都很大,但是自动语言识别仍然是一个重大挑战(也许表明语音处理领域的不成熟)。当前的语言识别系统在信息利用和系统复杂性方面差异很大。在一个统一的框架中理解所有这些方法是自动语言识别的主要挑战之一。在本文中,我们通过研究声学,音韵和韵律信息在特定语言识别系统中的作用来提供部分统一。首先对一项常见的两种语言任务(英语和日语)进行了比较研究,以了解这些问题。比较实验的结果被用作开发通用语言识别基线系统的基础。在此框架内,引入了两个新颖的LID信息源(后向语言模型和上下文相关的持续时间模型)。根据训练数据,这两个模型以适当的成本提高了语言建模的准确性。此外,还引入了一种新颖的优化方法来增强不同语言之间的区别。这些方法大大提高了系统性能。还初步研究了渠道标准化,会话语音和系统对新语言的适应性。基于本文工作开发的算法,开发了通用的LID软件工具包。最终开发的LID系统在常用的九种语言任务中,分别达到91%(45秒的段)和77%(十秒的段)的正确率。这是迄今为止在这些任务上报告的最佳结果之一。

著录项

  • 作者

    Yan, Yonghong.;

  • 作者单位

    Oregon Graduate Institute of Science and Technology.;

  • 授予单位 Oregon Graduate Institute of Science and Technology.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 1995
  • 页码 102 p.
  • 总页数 102
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号