Augmented Context Features for Arabic Speech Recognition

机译：阿拉伯语语音识别的增强上下文特征

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate different types of features for language modeling in Arabic automatic speech recognition. While much effort in language modeling research has been directed at designing better models or smoothing techniques for n-gram language models, in this paper we take the approach of augmenting the context in the n-gram model with different sources of information. We start by adding word class labels to the context. The word classes are automatically derived from un-annotated training data. As a contrast, we also experiment with POS tags which require a tagger trained on annotated data. An amalgam of these two methods uses class labels defined on word and POS tag combinations. Other context features include super-tags derived from the syntactic tree structure as well as semantic features derived from PropBank. Experiments on the DARPA GALE Arabic speech recognition task show that augmented context features often improve both perplexity and word error rate.

机译：我们研究阿拉伯自动语音识别中语言建模的不同类型的功能。尽管在语言建模研究上已进行了大量工作，旨在为n-gram语言模型设计更好的模型或平滑技术，但在本文中，我们采用了利用不同信息源来增强n-gram模型中上下文的方法。我们首先将单词类标签添加到上下文中。单词类别是从未注释的训练数据中自动得出的。相比之下，我们还尝试使用POS标签，该标签需要在带注释的数据上训练过的标记器。这两种方法的组合使用了在单词和POS标签组合上定义的类标签。其他上下文特征包括从语法树结构派生的超级标记以及从PropBank派生的语义特征。 DARPA GALE阿拉伯语语音识别任务的实验表明，增强的上下文功能通常可以提高困惑度和单词错误率。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.1832-1835|共4页
会议地点
作者
Ahmad Emami; Hong-Kwang J. Kuo; Imed Zitouni; Lidia Mangu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
language modeling; speech recognition; clustering; syntactic features;

机译：语言建模;语音识别;集群句法特征;

相似文献

外文文献
中文文献
专利

1. Efficient Feature Extraction Algorithms to Develop an Arabic Speech Recognition System [J] . A. A. Alasadi, T. H. Aldhayni, R. R. Deshmukh, Engineering Technology and Applied Science Research . 2020,第2期

机译：高效的特征提取算法开发阿拉伯语语音识别系统
2. A Canonicalization of Distinctive Phonetic Features to Improve Arabic Speech Recognition [J] . Alotaibi Yousef A., Selouani Sidh-Amed, Yakoub Mohammed Sidi, Acta acustica united with acustica . 2019,第6期

机译：不同语音特征的规范化，提高阿拉伯语语音识别
3. Robust Arabic speech recognition in noisy environments using prosodic features and formant [J] . A.I. Amrous, M. Debyeche, A. Amrouche International journal of speech technology . 2011,第4期

机译：使用韵律特征和共振峰在嘈杂的环境中进行强大的阿拉伯语语音识别
4. Augmented Context Features for Arabic Speech Recognition [C] . Ahmad Emami, Hong-Kwang J. Kuo, Imed Zitouni, Annual conference of the International Speech Communication Association . 2010

机译：Arabic语音识别的增强语境特征
5. Arabic language modeling with stem-derived morphemes for automatic speech recognition. [D] . Heintz, Ilana. 2010

机译：具有词干衍生语素的阿拉伯语言建模，可实现自动语音识别。
6. Formant analysis in dysphonic patients and automatic Arabic digit speech recognition [O] . Ghulam Muhammad, Tamer A Mesallam, Khalid H Malki, 2011

机译：语音障碍患者的共振峰分析和阿拉伯数字自动语音识别
7. The effects of speakers' gender, age, and region on overall performance of Arabic automatic speech recognition systems using the phonetically rich and balanced Modern Standard Arabic speech corpus [O] . Sawalha M, Abu Shariah M 2013

机译：发言者的性别，年龄和地区对使用语音丰富和平衡的现代标准阿拉伯语言语料库的阿拉伯语自动语音识别系统整体表现的影响

Augmented Context Features for Arabic Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅