首页> 外文期刊>Sensors and materials >Rapid Extraction of Research Areas from Scientific and Technological Literature
【24h】

Rapid Extraction of Research Areas from Scientific and Technological Literature

机译:科技文学研究领域的快速提取

获取原文
获取原文并翻译 | 示例
           

摘要

Along with the rapid development of Internet Plus, big data, and other technologies, the construction of smart cities is promoting the transformation and upgrading of mapping geographic information models from traditional information services to intelligent services with spatial sensing. At present, however, most of the knowledge needed to provide intelligent services is implicit in the form of unstructured text in various books and journal papers in related fields, which is difficult to capture, use, analyze, and share. In particular, geographical feature knowledge is one of the types of knowledge that needs to be extracted urgently. To solve this problem, in this paper, we propose a method for the rapid extraction of research areas from scientific and technological literature abstracts. Firstly, with the help of a general naming entity identification tool, we propose a method of rapidly annotating place-name entities in administrative divisions. Then, combining the bidirectional long short-term memory conditional random field (BiLSTM-CRF) model with a place-name database covering five levels of administrative divisions in China, the identification, disambiguation, and relationship extraction of place names in different administrative divisions are realized. On this basis, the extraction of research areas is regarded as a two-classification problem, feature vectors such as frequency and location are constructed for the names of the extracted administrative divisions, and the classification model is constructed with the random forest algorithm to rapidly extract research areas. The experimental results show that the recognition accuracy of place names in administrative areas in this study is 92.61% and the recognition accuracy of research areas is 90.31%. The results are superior to those of similar algorithms; thus, the proposed method can accurately and rapidly extract research areas.
机译:随着互联网加,大数据等技术的快速发展,智能城市的建设正在推动传统信息服务将地理信息模型的转型和升级,以空间传感。然而,目前,提供智能服务所需的大多数知识是以各种书籍和杂志中的非结构化文本的形式隐含,这些文件中的相关领域难以捕获,使用,分析和分享。特别地,地理特征知识是需要紧急提取的知识类型之一。为了解决这个问题,在本文中,我们提出了一种迅速提取科技文学摘要研究领域的方法。首先,在一般命名实体识别工具的帮助下,我们提出了一种在行政区划中快速注释地名实体的方法。然后,将双向短期内记忆条件随机字段(Bilstm-CRF)模型与涵盖中国五个行政部门的地方名称数据库,识别,消歧和关系提取不同行政区的地方意识到。在此基础上,研究领域的提取被认为是一个双分类问题,为频率和位置等特征向量构造了提取的管理分区的名称,并且分类模型用随机林算法进行了快速提取研究领域。实验结果表明,本研究行政区域中的地方名称的识别准确性为92.61%,研究区域的识别准确性为90.31%。结果优于类似算法的结果;因此,所提出的方法可以准确迅速提取研究领域。

著录项

  • 来源
    《Sensors and materials》 |2020年第12期|4489-4504|共16页
  • 作者单位

    Beijing University of Civil Engineering and Architecture School of Geomatics and Urban Spatial Informatics Beijing 102616 China;

    National Geomatics Centre of China Information Service Department Beijing 100830 China;

    Capital Normal University College of Resource Environment and Tourism Beijing 100048 China;

    National Geomatics Centre of China Information Service Department Beijing 100830 China;

    Beijing University of Civil Engineering and Architecture School of Geomatics and Urban Spatial Informatics Beijing 102616 China;

    Beijing University of Civil Engineering and Architecture School of Geomatics and Urban Spatial Informatics Beijing 102616 China;

    Beijing University of Civil Engineering and Architecture School of Geomatics and Urban Spatial Informatics Beijing 102616 China;

  • 收录信息 美国《科学引文索引》(SCI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    smart city; knowledge extraction; study area extraction; BiLSTM-CRF; random forest model;

    机译:聪明的城市;知识提取;研究区域提取;Bilstm-CRF;随机森林模型;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号