Long short-term memory language models with additive morphological features for automatic speech recognition

机译：长时记忆语言模型，具有可自动语音识别的附加形态特征

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Abstract Models of morphologically rich languages suffer from data sparsity when words are treated as atomic units. Word-based language models cannot transfer knowledge from common word forms to rarer variant forms. Learning a continuous vector representation of each morpheme allows a compositional model to represent a word as the sum of its constituent morphemes' vectors. Rare and unknown words containing common morphemes can thus be represented with greater fidelity despite their sparsity. Our novel neural network language model integrates this additive morphological representation into a long short-term memory architecture, improving Russian speech recognition word error rates by 0.9 absolute, 4.4% relative, compared to a robust n-gram baseline model.

机译：当将单词视为原子单位时，形态丰富的语言模型会遭受数据稀疏性的困扰。基于单词的语言模型无法将知识从常见的单词形式转移到较罕见的变体形式。学习每个语素的连续向量表示可以使组成模型将单词表示为其组成语素向量的总和。因此，尽管包含稀有词，但包含普通语素的稀有词和未知词却能以更高的保真度表示出来。我们的新型神经网络语言模型将该加法形态表示形式集成到了一个长期的短期记忆体系结构中，与健壮的n-gram基线模型相比，俄语语音识别词的错误率提高了0.9个绝对值，相对误差为4.4％。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2015年|5246-5250|共5页
会议地点
作者
Renshaw Daniel; Hall Keith B.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
compositional morphology; language modeling; long short-term memory; neural networks;

机译：成分形态;语言建模;长期短期记忆;神经网络;

相似文献

外文文献
中文文献
专利

1. Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector Machine [J] . Abdulbasit K. Al-Talabani Aro: The scientific journal of Koya University . 2020,第1期

机译：使用长短期内存和支持向量机自动识别来自语音信号的阿拉伯语诗表
2. Long short-term memory recurrent-neural-network-based bandwidth extension for automatic speech recognition [J] . Jun Ishii, Yuuki Tachioka Acoustical science and technology . 2016,第6期

机译：基于长时记忆递归神经网络的带宽扩展，可自动识别语音
3. Automatic speech recognition system with pitch dependent features for Punjabi language on KALDI toolkit [J] . Guglani Jyoti, Mishra A. N. Applied Acoustics . 2020,第Octa期

机译：在Kaldi Toolkit上的Punjabi语言具有音调依赖功能的自动语音识别系统
4. Long short-term memory language models with additive morphological features for automatic speech recognition [C] . D. Renshaw, K. B. Hall IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：长期短期记忆语言模型，具有添加性形态特征，用于自动语音识别
5. Automatic Speech Recognition for Low-Resource and Morphologically Complex Languages [D] . Morris, Ethan. 2021

机译：用于低资源和形态复杂语言的自动语音识别
6. Semi-Supervised Bidirectional Long Short-Term Memory and Conditional Random Fields Model for Named-Entity Recognition Using Embeddings from Language Models Representations [O] . Min Zhang, Guohua Geng, Jing Chen 2020

机译：使用语言模型表示的嵌入式识别命名实体识别的半监控双向短期内存和条件随机字段模型
7. LONG SHORT-TERMMEMORY LANGUAGE MODELS WITH ADDITIVE MORPHOLOGICAL FEATURES FOR AUTOMATIC SPEECH RECOGNITION [O] . Daniel Renshaw, Keith B. Hall 2015

机译：具有用于自动语音识别的添加形态特征的长短期语言模型

Long short-term memory language models with additive morphological features for automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅