首页> 外文学位 >Robust spoken document retrieval in multilingual and noisy acoustic environments.
【24h】

Robust spoken document retrieval in multilingual and noisy acoustic environments.

机译:在多语言和嘈杂的声学环境中进行可靠的语音文档检索。

获取原文
获取原文并翻译 | 示例

摘要

The focus of this thesis is rapidly and effectively increasing the capability and robustness of spoken information search technology in different languages and acoustic conditions. There are two primary thesis contributions that include distinct yet related areas.;The first thesis contribution addresses language leveraging from a speech retrieval point of view. More specifically, language leveraging algorithms are proposed to deploy spoken information search systems in new languages for which training resources are limited. The key idea is to employ knowledge from existing resource-rich language resources and the similarity of these resource-rich languages to a target language to improve spoken information search performance in the target language. Based on this key idea, multilingual parallel and hybrid system combination algorithms are proposed using phonetic lattice-based document and query representations. Experiments in a proper name retrieval task show that retrieval performance degradations (due to data sparseness during automatic speech recognition development in the target language) are compensated for by employing a phonetic recognition system from a resource-rich language. It is shown that the proposed algorithms for developing multilingual spoken information search technology in under-represented languages are able to achieve comparable retrieval performance using less training data. As a side contribution, a similar idea is also employed in a bilingual speaker recognition task where training and test data can be in a person's native language, L1, or in a second language L2. Again, the acoustic similarity between the language pairs are explored to effectively combine individual language-dependent speaker recognition systems in a parallel or hybrid fashion.;The second contribution focuses on the impact of acoustic condition change on retrieval performance in heterogeneous spoken audio collections. Proposed methods towards robust audio indexing and retrieval to reduce the acoustic mismatch employ an Environmental Sniffing module to organize data according to acoustic content, and to capture knowledge to adapt spoken document retrieval to changing acoustic conditions. Based on this key idea, robust parallel or hybrid system combination approaches are investigated using large vocabulary continuous speech recognition (LVCSR) based and sub-word based retrieval systems. Lattice-based vector space retrieval models are implemented using transducer indexes. This adaptive scheme yields significant improvement in terms of retrieval performance over traditional system combination methods.;Collectively, these contributions enable rapid transition of spoken document retrieval to new languages and acoustically heterogeneous audio collections.
机译:本文的重点是快速有效地提高语音信息搜索技术在不同语言和声学条件下的能力和鲁棒性。有两个主要的论文贡献,包括不同但又相关的领域。第一个论文贡献是从语音检索的角度解决语言利用问题。更具体地,提出了语言杠杆算法以训练资源受限的新语言来部署口语信息搜索系统。关键思想是利用来自现有资源丰富的语言资源的知识以及这些资源丰富的语言与目标语言的相似性,以提高目标语言中的语音信息搜索性能。基于这一关键思想,提出了基于语音格的文档和查询表示的多语言并行和混合系统组合算法。专有名称检索任务中的实验表明,通过使用来自资源丰富的语言的语音识别系统,可以弥补检索性能的下降(由于目标语言在自动语音识别开发过程中数据稀疏)。结果表明,所提出的用于以代表性不足的语言开发多语言口语信息搜索技术的算法能够使用较少的训练数据来实现可比的检索性能。作为附带的贡献,在双语说话者识别任务中也采用了类似的想法,其中训练和测试数据可以采用人的母语L1或第二语言L2。再次,探索了语言对之间的声学​​相似性,以有效地以并行或混合方式组合了各个依赖于语言的说话者识别系统。第二个贡献集中于声学条件变化对异构语音集合中检索性能的影响。针对鲁棒的音频索引和检索以减少声学失配的提议方法采用环境嗅探模块来根据声学内容组织数据,并捕获知识以使语音文档检索适应不断变化的声学条件。基于此关键思想,使用基于大词汇量连续语音识别(LVCSR)和基于子词的检索系统研究了鲁棒的并行或混合系统组合方法。基于格的向量空间检索模型是使用换能器索引实现的。与传统的系统组合方法相比,该自适应方案在检索性能方面产生了显着的改进。总之,这些贡献使语音文档检索迅速过渡到新语言和听觉上异类的音频集合。

著录项

  • 作者

    Akbacak, Murat.;

  • 作者单位

    University of Colorado at Boulder.;

  • 授予单位 University of Colorado at Boulder.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 156 p.
  • 总页数 156
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号