Towards long-range prosodic attribute modeling for language recognition

机译：走向用于语言识别的远程韵律属性建模

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As a high-level feature, prosody may be an effective feature when it is modeled over longer ranges than the typical range of a syllable. This paper is about language recognition with the high-level prosodic attributes. It studies two important issues of long-range modeling, namely the data scarcity handling method, and the model which properly describes prosodic boundary events. Illustrated by NIST language recognition evaluation (LRE) 2009, long-range modeling is shown to bring a 7.2% relative improvement to a prosodic language detector. Score fusion between the long-range prosodic system and a phonotactic system gives an EER of 3.07%. Exploiting boundary N-grams is the main contributing factor to global EER reduction, while different long-range prosodic modeling factors benefit the detection of different languages. Analysis reveals the evidence of language-specific long-range prosodic attributes, which sheds light on robust long-range modeling methods for language recognition.

机译：作为高级功能，韵律在比音节的典型范围更长的范围内建模时可能是有效的功能。本文是关于具有高级韵律属性的语言识别的。它研究了远程建模的两个重要问题，即数据稀缺性处理方法和正确描述韵律边界事件的模型。由NIST语言识别评估（LRE）2009说明，远程建模显示给韵律语言检测器带来7.2％的相对改进。远程韵律系统和音律系统之间的分数融合得出的EER为3.07％。利用边界N-gram是全局EER减少的主要促成因素，而不同的远程韵律建模因素则有利于检测不同的语言。分析揭示了特定于语言的远程韵律属性的证据，这为用于语言识别的强大的远程建模方法提供了启示。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.1792-1795|共4页
会议地点
作者
Raymond W. M. Ng; Cheung-Chi Leung; Ville Hautamdki; Tan Lee; Bin Ma; Haizhou Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
language recognition; prosody; long-range modeling;

机译：语言识别;韵律远程建模;

相似文献

外文文献
中文文献
专利

1. Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition [J] . Ananthakrishnan S., Narayanan S. IEEE transactions on audio, speech and language processing . 2009,第1期

机译：类别韵律模型的无监督适应，用于韵律标记和语音识别
2. Application of prosody models for developing speech systems in Indian languages [J] . K. Sreenivasa Rao International journal of speech technology . 2011,第1期

机译：韵律模型在印度语言开发中的应用
3. Universal attribute characterization of spoken languages for automatic spoken language recognition [J] . Sabato Marco Siniscalchi, Jeremy Reed, Torbjorn Svendsen, Computer speech and language . 2013,第1期

机译：口语的通用属性表征，用于自动口语识别
4. Towards long-range prosodic attribute modeling for language recognition [C] . Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamdki, Annual conference of the International Speech Communication Association . 2010

机译：朝向语言识别的远程韵律属性建模
5. Modeling prosodic differences for speaker and language recognition. [D] . Adami, Andre Gustavo. 2004

机译：为说话者和语言识别建模韵律差异。
6. Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition [O] . Sankaranarayanan Ananthakrishnan, Shrikanth Narayanan -1

机译：类别韵律模型的无监督适应用于韵律标记和语音识别
7. PROSODY MODELING AND EIGEN-PROSODY ANALYSIS FOR ROBUST SPEAKER RECOGNITION [O] . Zi-he Chen, Yuan-fu Liao, Yau-tarng Juang 2009

机译：健壮说话人识别的模型建模与特征本构分析

Towards long-range prosodic attribute modeling for language recognition

摘要

著录项

相似文献

相关主题

期刊订阅