首页> 外文会议>Conference on empirical methods in natural language processing >Understanding Convolutional Neural Networks for Text Classification
【24h】

Understanding Convolutional Neural Networks for Text Classification

机译:了解文本分类的卷积神经网络

获取原文

摘要

We present an analysis into the inner workings of Convolutional Neural Networks (CNNs) for processing text. CNNs used for computer vision can be interpreted by projecting filters into image space, but for discrete sequence inputs CNNs remain a mystery. We aim to understand the method by which the networks process and classify text. We examine common hypotheses to this problem: that filters, accompanied by global max-pooling, serve as ngram detectors. We show that filters may capture several different semantic classes of ngrams by using different activation patterns, and that global max-pooling induces behavior which separates important ngrams from the rest. Finally, we show practical use cases derived from our findings in the form of model interpretability (explaining a trained model by deriving a concrete identity for each filter, bridging the gap between visualization tools in vision tasks and NLP) and prediction interpretability (explaining predictions).
机译:我们对卷积神经网络(CNNS)的内部运作进行了分析,用于处理文本。用于计算机视觉的CNN可以通过将过滤器投影到图像空间中来解释,但对于离散序列,CNNS仍然是一个谜。我们的目标是理解网络流程和分类文本的方法。我们将常见的假设审视此问题:伴随全局最大池的过滤器,用作ngram检测器。我们显示过滤器可以通过使用不同的激活模式来捕获几种不同的语义类别,并且全局最大池引起从其余部分分开重要Ngrams的行为。最后,我们展示了模型解释性形式的实用用例,以模型解释性的形式(通过导出每个过滤器的具体身份来解释训练的模型,拓展了视觉任务和NLP中的可视化工具之间的差距和预测解释性(解释预测) 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号