...
首页> 外文期刊>Applied Soft Computing >Deep neural network for hierarchical extreme multi-label text classification
【24h】

Deep neural network for hierarchical extreme multi-label text classification

机译:深度神经网络,用于分层极端多标签文本分类

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The classification of natural language texts has gained a growing importance in many real world applications due to its significant implications in relation to crucial tasks, such as Information Retrieval, Question Answering, Text Summarization, Natural Language Understanding. In this paper we present an analysis of a Deep Learning architecture devoted to text classification, considering the extreme multi-class and multi-label text classification problem, when a hierarchical label set is defined. The paper presents a methodology named Hierarchical Label Set Expansion (HLSE), used to regularize the data labels, and an analysis of the impact of different Word Embedding (WE) models that explicitly incorporate grammatical and syntactic features. We evaluate the aforementioned methodologies on the PubMed scientific articles collection, where a multi-class and multi-label text classification problem is defined with the Medical Subject Headings (MeSH) label set, a hierarchical set of 27,775 classes. The experimental assessment proves the usefulness of the proposed HLSE methodology and also provides some interesting results relating to the impact of different uses and combinations of WE models as input to the neural network in this kind of application. (C) 2019 Elsevier B.V. All rights reserved.
机译:由于其与关键任务的重要意义,自然语言文本的分类在许多现实世界应用中取得了越来越重要的意义,例如信息检索,问题回答,文本摘要,自然语言理解。在本文中,考虑到定义分层标签集时,我们展示了专门致力于文本分类的深度学习架构的分析。本文介绍了一个名为分层标签集扩展(HLSE)的方法,用于对数据标签进行正规化,以及分析不同单词嵌入(我们)模型的模型,明确地包含了语法和语法特征。我们评估了上述关于PubMed Scientific文章集合的方法,其中,使用医疗主题标题(网格)标签集,分层组27,775类定义了多级和多标签文本分类问题。实验评估证明了拟议的HLSE方法的有用性,并提供了与不同用途的影响以及我们模型的影响与这种应用中的神经网络的影响有关的一些有趣结果。 (c)2019年Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号