首页> 外文会议>International Conference on Advanced Computing >Performance Evaluation of Deep Learning Algorithms in Biomedical Document Classification
【24h】

Performance Evaluation of Deep Learning Algorithms in Biomedical Document Classification

机译:深度学习算法在生物医学文献分类中的性能评估

获取原文

摘要

Document classification is a prevalent task in Natural Language Processing (NLP), which has an extensive range of applications in the biomedical domains such as biomedical literature indexing, automatic diagnosis codes assignment, tweets classification for public health topics, and patient safety reports classification. Nevertheless, manual classification of biomedical articles published every year into specific predefined categories becomes a cumbersome task. Hence, building an automatic document classification for biomedical databases emerges as a significant task among the scientific community. In recent years, Deep Learning (DL) models like Deep Neural Networks (DNN), Convolution Neural Networks (CNN), Recurrent Neural Networks (RNN), and Ensemble Deep Learning models are widely used in the area of text document classification for better classification performance compared to Machine Learning (ML) algorithms. The major advantage of using DL models in document classification is that it provides rich semantic and grammatical information for document representation through pre-trained word embedding. Hence, this paper investigates the deployment of the various state-of-the-art DL based classification models in automatic classification of benchmark biomedical datasets. Finally, the performance of all the aforementioned constitutional classifiers is compared and evaluated through the well-defined performance evaluation metrics such as accuracy, precision, recall, and f1-measure.
机译:文档分类是自然语言处理(NLP)中的一项普遍任务,它在生物医学领域中具有广泛的应用,例如生物医学文献索引,自动诊断代码分配,针对公共卫生主题的推文分类以及患者安全报告分类。然而,每年出版的生物医学文章的人工分类到特定的预定义类别成为繁琐的任务。因此,为生物医学数据库建立自动文档分类成为科学界的一项重要任务。近年来,深度学习(DL)模型(如深度神经网络(DNN),卷积神经网络(CNN),递归神经网络(RNN)和集成深度学习模型)广泛用于文本文档分类领域,以实现更好的分类性能与机器学习(ML)算法相比。在文档分类中使用DL模型的主要优势在于,它通过预先训练的词嵌入为文档表示提供了丰富的语义和语法信息。因此,本文研究了在基准生物医学数据集的自动分类中各种基于DL的最新分类模型的部署。最后,通过定义明确的性能评估指标(例如准确性,精度,召回率和f1-measure)比较和评估所有上述组成分类器的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号