首页> 外文期刊>Advances in Distributed Computing And Artificial Intelligence Journal >Applying a text mining framework to the extraction of numerical parameters from scientific literature in the biotechnology domain
【24h】

Applying a text mining framework to the extraction of numerical parameters from scientific literature in the biotechnology domain

机译:将文本挖掘框架应用于从生物技术领域的科学文献中提取数值参数

获取原文
获取外文期刊封面目录资料

摘要

Scientific publications are the main vehicle to disseminate information in the field of biotechnology for wastewater treatment. Indeed, the new research paradigms and the application of high-throughput technologies have increased the rate of publication considerably. The problem is that manual curation becomes harder, prone-to-errors and time-consuming, leading to a probable loss of information and inefficient knowledge acquisition. As a result, research outputs are hardly reaching engineers, hampering the calibration of mathematical models used to optimize the stability and performance of biotechnological systems. In this context, we have developed a data curation workflow, based on text mining techniques, to extract numerical parameters from scientific literature, and applied it to the biotechnology domain. A workflow was built to process wastewater-related articles with the main goal of identifying physico-chemical parameters mentioned in the text. This work describes the implementation of the workflow, identifies achievements and current limitations in the overall process, and presents the results obtained for a corpus of 50 full-text documents.
机译:科学出版物是传播用于废水处理的生物技术领域信息的主要工具。实际上,新的研究范式和高通量技术的应用大大提高了发表率。问题是手动管理变得更加困难,容易出错并且耗时,导致信息丢失和知识获取效率低下。结果,研究成果几乎没有到达工程师手中,这妨碍了用于优化生物技术系统稳定性和性能的数学模型的校准。在这种情况下,我们已经开发了一种基于文本挖掘技术的数据管理工作流,以从科学文献中提取数值参数,并将其应用于生物技术领域。建立了一个工作流程来处理与废水有关的物品,其主要目的是确定本文中提到的理化参数。这项工作描述了工作流程的实现,确定了整个过程中的成就和当前的局限性,并介绍了从50个全文本文档集中获得的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号