首页> 外文期刊>Neurocomputing >Finding decision jumps in text classification
【24h】

Finding decision jumps in text classification

机译:寻找文本分类中的决策跳跃

获取原文
获取原文并翻译 | 示例

摘要

Text classification is one of the key problems in natural language processing (NLP), and in early years, it was usually accomplished by feature-based machine learning models. Recently, the deep neural network has become a powerful learning machine, making it possible to work with text itself as raw input for the classification problems. However, existing neural networks are typically end-to-end and lack explicit interpretation of the prediction. In this paper, we propose JUMPER, a novel framework that models text classification as a sequential decision process. Generally, JUMPER is a neural system that scans a piece of text sequentially and makes classification decisions at the time it wishes, which is inspired by the cognitive process of human text reading. In our framework, both the classification result and when to make the classification are part of the decision process, controlled by a policy network and trained with reinforcement learning. Experimental results of real-world applications demonstrate the following properties of a properly trained JUMPER: (1) it tends to make decisions whenever the evidence is enough, therefore reducing total text reading by 30-40% and often finding the key rationale of the prediction; and (2) it achieves classification accuracy better than or comparable to state-of-the-art models in several benchmark and industrial datasets. We further conduct a simulation experiment with mock data, which confirms that JUMPER is able to make a decision at the theoretically optimal decision position. (C) 2019 Elsevier B.V. All rights reserved.
机译:文本分类是自然语言处理(NLP)中的关键问题之一,在早期,它通常是通过基于特征的机器学习模型来完成的。近年来,深度神经网络已成为功能强大的学习机器,从而有可能将文本本身作为分类问题的原始输入。但是,现有的神经网络通常是端到端的,缺乏对预测的明确解释。在本文中,我们提出了JUMPER,这是一个将文本分类建模为顺序决策过程的新颖框架。通常,JUMPER是一种神经系统,可以顺序扫描一段文本并在希望的时间做出分类决定,这受人类文本阅读的认知过程的启发。在我们的框架中,分类结果和何时进行分类都是决策过程的一部分,受策略网络控制并接受强化学习培训。实际应用程序的实验结果表明,经过适当训练的JUMPER具有以下特性:(1)它倾向于在证据充分时做出决策,因此将总文本阅读量减少30-40%,并且经常找到预测的关键原理; (2)在几个基准数据和工业数据集中,其分类精度优于或与最新模型相当。我们进一步使用模拟数据进行了模拟实验,这证实了JUMPER能够在理论上最佳的决策位置做出决策。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号