Text Language Identification Using Attention-Based Recurrent Neural Networks

机译：使用基于注意力的递归神经网络进行文本语言识别

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The main purpose of this work is to explore the use of Attention-based Recurrent Neural Networks for text language identification. The most common, statistical language identification approaches are effective but need a long text to perform well. To address this problem, we propose the neural model based on the Long Short-Term Memory Neural Network augmented with the Attention Mechanism. The evaluation of the proposed method incorporates tests on texts written in disparate styles and tests on the Twitter posts corpus which comprises short and noisy texts. As a baseline, we apply a widely used statistical method based on a frequency of occurrences of n-grams. Additionally, we investigate the impact of an Attention Mechanism in the proposed method by comparing the results with the outcome of the model without an Attention Mechanism. As a result, the proposed model outperforms the baseline and achieves 97,98% accuracy on the test corpus covering 36 languages and keeps the accuracy also for the Twitter corpus achieving 91,6% accuracy.

机译：这项工作的主要目的是探索使用基于注意力的递归神经网络进行文本语言识别。最常用的统计语言识别方法是有效的，但需要较长的文本才能很好地执行。为了解决这个问题，我们提出了一种基于长短期记忆神经网络的神经模型，并增加了注意力机制。对提出的方法的评估包括对以不同样式编写的文本的测试，以及对包含简短和嘈杂文本的Twitter帖子语料库的测试。作为基准，我们基于n-gram的出现频率应用了广泛使用的统计方法。此外，我们通过将结果与没有注意机制的模型的结果进行比较，研究了注意机制在拟议方法中的影响。结果，所提出的模型优于基准，并且在涵盖36种语言的测试语料库上达到了97.98％的准确性，并保持Twitter语料库也达到了91.6％的准确性。

著录项

来源
《International Conference on Artificial Intelligence and Soft Computing》|2019年|181-190|共10页
会议地点
作者
Michal Perelkiewicz; Rafal Poswiata;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. 基于分散注意力递归神经网络的无人机去雨算法 [J] . 冯一箪, 邓森, 魏明强南京航空航天大学学报（英文版） . 2020,第004期
2. Text Classification Research with Attention-based Recurrent Neural Networks [J] . Du C., Huang L. International journal of computers, communications & control . 2018,第1期

机译：基于注意力的递归神经网络的文本分类研究
3. Text Classi?cation Research with Attention-based Recurrent Neural Networks [J] . C. Du, L. Huang International journal of computers, communications and control . 2018,第1期

机译：基于注意力的递归神经网络的文本分类研究
4. Text Classification Research with Attention-based Recurrent Neural Networks [J] . Changshun Du, Lei Huang IAENG Internaitonal journal of computer science . 2018,第1期

机译：基于关注的复发神经网络的文本分类研究
5. Text Language Identification Using Attention-Based Recurrent Neural Networks [C] . Michal Perelkiewicz, Rafal Poswiata International Conference on Artificial Intelligence and Soft Computing . 2019

机译：基于注意力的经常性神经网络的文本语言识别
6. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
7. Single-modal and multi-modal false arrhythmia alarm reduction using attention-based convolutional and recurrent neural networks [O] . Sajad Mousavi, Atiyeh Fotoohinasab, Fatemeh Afghah 2020

机译：使用基于注意力的卷积和经常性神经网络的单模和多模态假心律失常报警
8. Attention-Based Recurrent Neural Networks (RNNs) for Short Text Classification: An Application in Public Health Monitoring [O] . Oduwa Edo-Osagie, Iain Lake, Obaghe Edeghere, 2019

机译：基于注意力的经常性神经网络（RNNS）用于短文本分类：公共卫生监测中的应用

Text Language Identification Using Attention-Based Recurrent Neural Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅