首页> 外文会议>Workshop on Scholarly Document Processing >The Biomaterials Annotator: a system for ontology-based concept annotation of biomaterials text
【24h】

The Biomaterials Annotator: a system for ontology-based concept annotation of biomaterials text

机译:生物材料注释器:基于本体的生物材料文本概念注释系统

获取原文
获取外文期刊封面目录资料

摘要

Biomaterials are synthetic or natural materials used for constructing artificial organs, fabricating prostheses, or replacing tissues. The last century saw the development of thousands of novel biomaterials and, as a result, an exponential increase in scientific publications in the field. Large-scale analysis of biomaterials and their performance could enable data-driven material selection and implant design. However, such analysis requires identification and organization of concepts, such as materials and structures, from published texts. To facilitate future information extraction and the application of machine-learning techniques, we developed a semantic annotator specifically tailored for the biomaterials literature. The Biomaterials Annotator has been implemented following a modular organization using software containers for the different components and orchestrated using Nextflow as workflow manager. Natural language processing (NLP) components are mainly developed in Java. This set-up has allowed named entity recognition of seventeen classes relevant to the biomaterials domain. Here we detail the development, evaluation and performance of the system, as well as the release of the first collection of annotated biomaterials abstracts. We make both the corpus and system available to the community to promote future efforts in the field and contribute towards its sustainability.
机译:生物材料是用于构建人工器官、制造假体或替换组织的合成或天然材料。上个世纪见证了数千种新型生物材料的发展,因此,该领域的科学出版物呈指数级增长。对生物材料及其性能的大规模分析可以实现数据驱动的材料选择和植入物设计。然而,这种分析需要从出版的文本中识别和组织概念,如材料和结构。为了促进未来信息提取和机器学习技术的应用,我们开发了一个专门针对生物材料文献的语义注释器。Biomaterials Annotator按照模块化组织实施,使用不同组件的软件容器,并使用Nextflow作为工作流管理器进行编排。自然语言处理(NLP)组件主要是用Java开发的。这种设置允许命名实体识别与生物材料领域相关的17个类别。这里我们详细介绍了系统的开发、评估和性能,以及第一批带注释的生物材料摘要的发布。我们向社区提供语料库和系统,以促进该领域未来的努力,并为其可持续性做出贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号