首页> 外文学位 >Learning sub-word units and exploiting contextual information for open vocabulary speech recognition.
【24h】

Learning sub-word units and exploiting contextual information for open vocabulary speech recognition.

机译:学习子词单位并利用上下文信息进行开放式词汇语音识别。

获取原文
获取原文并翻译 | 示例

摘要

Large vocabulary continuous speech recognition (LVCSR) systems fail to recognized words beyond their vocabulary, many of which are information rich terms such as named entities, technical terms, or foreign words. Mis-recognizing these Out-of-Vocabulary (OOV) words can have a disproportionate impact in transcript coherence, and cause recognition failures which propagate through pipeline systems, impacting the performance of downstream applications. Ideally, a speech recognition system would be able to recognize arbitrary, even previously unseen, words.;This dissertation presents an approach to recover from failures caused by OOVs by automatically identifying when OOVs are spoken and transcribing them using sub-lexical units. This results in a hybrid word/sub-word system which predicts full-words for in-vocabulary terms and sub-lexical units for OOVs. We first present an approach to model OOVs using sub-lexical units automatically learned from data. The learned units are variable-length phone sequences, which are included in the recognizer's vocabulary and language model. Previous work heuristically creates the sub-word lexicon from phonetic representations of text using simple statistics to select common phone sequences. Instead, we propose a novel unsupervised approach to learn the sub-word lexicon optimized for a given task. This approach employs a log-linear model with overlapping features to learn multi-phone units obtained by segmenting the phonetic representation of a corpus.;OOV Detection is the task of identifying regions in the recognizer's output where out-of-vocabulary words were uttered. The detection of OOV regions is helpful to avoid error propagation to downstream applications such as machine translation, named entity recognition, and spoken document retrieval. We combine the proposed hybrid system with confidence based metrics to improve OOV detection performance. Previous work address OOV detection as a binary classification task, where each region is independently classified using local information. This dissertation treats this problem as a sequence labeling problem, and shows that (1) jointly predicting out-of-vocabulary regions, (2) including contextual information from each region, and (3) learning sub-lexical units optimized for this task, leads to substantial improvements with respect to state-of-the-an systems.;The resulting sub-word representation and OOV detector is helpful to recover the correct spelling of new words, resulting in an open-vocabulary system; and improves performance in downstream applications strongly affected by out-of-vocabulary terms, such as: spoken term detection and named entity recognition in speech.
机译:大型词汇连续语音识别(LVCSR)系统无法识别超出其词汇范围的单词,其中许多都是信息丰富的术语,例如命名实体,技术术语或外来词。错误地识别这些词外(OOV)词可能会对成绩单连贯性产生不成比例的影响,并导致识别失败,这些失败会通过管道系统传播,从而影响下游应用程序的性能。理想情况下,语音识别系统将能够识别任意的,甚至以前看不见的单词。本论文提出了一种通过自动识别何时说出OOV并使用亚词汇单位转录OOV来从OOV造成的故障中恢复的方法。这导致了混合词/子词系统,该系统可预测词汇中的完整词和OOV的子词法单位。我们首先提出一种使用从数据中自动学习的亚词汇单位对OOV建模的方法。学习的单位是可变长度的电话序列,这些序列包含在识别器的词汇和语言模型中。以前的工作使用简单的统计信息从文本的语音表示中试探性地创建子词词典,以选择常见的电话序列。相反,我们提出了一种新颖的无监督方法来学习针对给定任务优化的子词词典。该方法采用具有重叠特征的对数线性模型来学习通过分割语料库的语音表示而获得的多电话单元。OOV检测是识别识别器输出中发声过的单词的区域的任务。 OOV区域的检测有助于避免错误传播到下游应用程序,例如机器翻译,命名实体识别和语音文档检索。我们将提出的混合系统与基于置信度的指标结合起来,以提高OOV检测性能。先前的工作将OOV检测作为二进制分类任务,其中使用本地信息对每个区域进行独立分类。本文将这个问题视为序列标记问题,并表明(1)共同预测语音区以外的区域,(2)包括每个区域的上下文信息,以及(3)学习为此任务优化的亚词法单元,所产生的子词表示和OOV检测器有助于恢复新词的正确拼写,从而形成开放词汇系统;并提高了受词汇量严重影响的下游应用程序的性能,例如:语音术语检测和语音中的命名实体识别。

著录项

  • 作者

    Parada, Maria Carolina.;

  • 作者单位

    The Johns Hopkins University.;

  • 授予单位 The Johns Hopkins University.;
  • 学科 Engineering Electronics and Electrical.;Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 181 p.
  • 总页数 181
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号