Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines

机译：向您的n-gram显示一些爱：一些进步和更强大的n-gram语言建模基准

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years neural language models (LMs) have set state-of-the-art performance for several benchmarking datasets. While the reasons for their success and their computational demand are well-documented, a comparison between neural models and more recent developments in n-gram models is neglected. In this paper, we examine the recent progress in n-gram literature, running experiments on 50 languages covering all morphological language families. Experimental results illustrate that a simple extension of Modified Kneser-Ncy outperforms an LSTM language model on 42 languages while a word-level Bayesian n-gram LM (Shareghi el al., 2017) outperforms the character-awarc neural model (Kim ct al., 2016) on average across all languages, and its extension which explicitly injects linguistic knowledge (Gerz et al., 2018a) on 8 languages. Further experiments on larger Eu-roparl datasets for 3 languages indicate that neural architectures arc able to outperform computationally much cheaper n-gram models: n-gram training is up to 15, 000×quicker. Our experiments illustrate that standalone n-gram models lend themselves as natural choices for resource-lean or morphologically rich languages, while the recent progress has significantly improved their accuracy.

机译：近年来，神经语言模型（LM）为多个基准数据集设置了最先进的性能。尽管它们成功的原因及其计算需求已得到充分证明，但神经模型与n-gram模型的最新发展之间的比较却被忽略了。在本文中，我们检查了n-gram文学的最新进展，对涵盖所有形态语言族的50种语言进行了实验。实验结果表明，改进的Kneser-Ncy的简单扩展优于42种语言的LSTM语言模型，而单词级贝叶斯n-gram LM（Shareghi et al。，2017）优于字符预警神经模型（Kim ct。（2016年）。平均而言，其扩展名可以为8种语言明确注入语言知识（Gerz等，2018a）。在3种语言的更大Eu-roparl数据集上的进一步实验表明，神经体系结构能够在计算上优于便宜得多的n-gram模型：n-gram训练最多可达到15，000×剔除。我们的实验表明，独立的n-gram模型很适合作为资源贫乏或形态丰富的语言的自然选择，而最近的进展已大大提高了它们的准确性。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2019年|4113-4118|共6页
会议地点
作者
Ehsan Shareghi; Daniela Gerz; Ivan Vulic; Anna Korhonen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. An empirical study of statistical language models: n-gram language models vs. neural network language models [J] . Freha Mezzoudj, Abdelkader Benyettou International Journal of Innovative Computing and Applications . 2018,第4期

机译：统计语言模型的实证研究：n-gram语言模型与神经网络语言模型
2. Converting Continuous-Space Language Models into N-gram Language Models with Efficient Bilingual Pruning for Statistical Machine Translation [J] . RUI WANG, MASAO UTIYAMA, ISAO GOTO, ACM transactions on Asian language information processing . 2016,第3期

机译：通过高效的双语修剪将连续空间语言模型转换为N-gram语言模型以进行统计机器翻译
3. NgramSPD: Exploring optimal n-gram model for sentiment polarity detection in different languages [J] . Graovac Jelena, Mladenovic Miljana, Tanasijevic Ivana Intelligent data analysis . 2019,第2期

机译：NgramSPD：探索用于不同语言的情感极性检测的最佳n-gram模型
4. Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines [C] . Ehsan Shareghi, Daniela Gerz, Ivan Vulic, Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2019

机译：向你的N-grams显示一些爱：有点进步和更强大的n克语言建模基线
5. Language-independent text learning with statistical n-gram language models. [D] . Peng, Fuchun. 2003

机译：统计n-gram语言模型的独立于语言的文本学习。
6. Modeling Actions of PubMed Users with N-Gram Language Models [O] . Jimmy Lin, W. John Wilbur -1

机译：N-Gram语言模型对PubMed用户的建模动作
7. WAYS TO IMPROVE N-GRAM LANGUAGE MODELS FOR OCR AND SPEECH RECOGNITION OF SLAVIC LANGUAGES [O] . Volume Issue, V. Taranukha 2015

机译：提高N-GRam语言模型的方法，用于对sLaVIC语言进行OCR和语音识别
8. Investigation of Back-off Based Interpolation Between Recurrent Neural Network and N-gram Language Models (Author's Manuscript). [R] . Chen, X., Liu, X., Gales, M. J. F., 2016

机译：基于回退的递归神经网络与N-gram语言模型的插值研究（作者手稿）。

Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines

摘要

著录项

相似文献

相关主题

期刊订阅