...
首页> 外文期刊>Expert systems with applications >Hierarchical and lateral multiple timescales gated recurrent units with pre-trained encoder for long text classification
【24h】

Hierarchical and lateral multiple timescales gated recurrent units with pre-trained encoder for long text classification

机译:具有预先培训的编码器的分层和横向多个时间尺寸,用于长文本分类

获取原文
获取原文并翻译 | 示例
           

摘要

Text classification, using deep learning techniques, has become a research challenge in natural language processing. Most of the existing deep learning models for text classification face difficulties when the length of the input text increases. Most models work well on shorter text inputs, however, their performance degrades with the increase in the input length. In this work, we introduce a model for text classification that can alleviate this problem. We present the hierarchical and lateral multiple timescales gated recurrent units (HL-MTGRU), in combination with pre-trained encoders to address the long text classification problem. HL-MTGRU can represent multiple temporal scale dependencies for the discrimination task. By combining the slow and fast units of the HL-MTGRU, our model effectively classifies long multi-sentence texts into the desired classes. We also show that the HL-MTGRU structure helps the model to prevent degradation of performance on longer text inputs. We demonstrate that the proposed network with the help of the latest pre-trained encoders for feature extraction outperforms the conventional models on various long text classification benchmark datasets.
机译:文本分类,利用深度学习技术,已成为自然语言处理中的研究挑战。当输入文本的长度增加时,大多数用于文本分类的深度学习模型面临困难。大多数模型在短文本输入上运行良好,但它们的性能随着输入长度的增加而劣化。在这项工作中,我们介绍了一种可以缓解此问题的文本分类模型。我们介绍了分层和横向多个时间尺寸门控复发单元(HL-MTGRU),与预先培训的编码器组合以解决长文本分类问题。 HL-MTGRU可以表示歧视任务的多个时间尺度依赖性。通过组合HL-MTGRU的慢速和快速单元,我们的模型有效地将长的多句子文本分类为所需的类。我们还表明HL-MTGRU结构有助于模型,以防止在更长的文本输入上降低性能。我们展示了具有用于特征提取的最新预先训练的编码器的提出的网络优先于各种长文本分类基准数据集中的传统模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号