首页> 外文期刊>Earth Science Informatics >BiLSTM-CRF for geological named entity recognition from the geoscience literature
【24h】

BiLSTM-CRF for geological named entity recognition from the geoscience literature

机译:Bilstm-CRF从地球科学文献中获得地质名称实体识别

获取原文
获取原文并翻译 | 示例
           

摘要

Many detailed geoscience reports lie unused, offering both challenges and opportunities for information extraction. In geoscience research, geological named entity recognition (GNER) is an important task in the field of geoscience information extraction. Regarding numerical geoscience data, research on information extraction remains limited. Most conventional NER approaches are heavily dependent on feature engineering, and such sentence-level-based methods suffer from the tagging inconsistency problem. Based on the above observations, this paper proposes a neural network approach, namely, attention-based bidirectional long short-term memory with a conditional random field layer (Att-BiLSTM-CRF), for name entity recognition to extract information entities describing geoscience information from geoscience reports. This approach leverages global information learned from an attention mechanism to enforce tagging consistency across multiple instances of the same token in a document. Experiments on the constructed dataset show that our method achieves comparable performance to that of other state-of-the-art systems. Additionally, our method achieved an average F1 score of 91.47% in the NER extraction task.
机译:许多详细的地球科学报告谎言未使用,为信息提取提供挑战和机会。在地球科学研究中,地质命名实体识别(GNER)是地球科学信息提取领域的重要任务。关于数字地球科学数据,信息提取研究仍然有限。大多数传统的NER方法都依赖于特征工程,并且这种基于句子级的方法遭受标记不一致问题。基于上述观察,本文提出了一种神经网络方法,即,具有条件随机场层(ATT-Bilstm-CRF)的关注的双向长期内记忆,用于提取描述地球科学信息的信息实体来自地球科学报告。此方法利用了从注意机制中学到的全局信息来强制在文档中的同一令牌的多个实例上强制标记标记一致性。构造数据集的实验表明,我们的方法对其他最先进系统的性能达到了可比性。此外,我们的方法在NER提取任务中实现了91.47%的平均F1得分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号