首页> 外文会议>International Conference on Practical Applications of Computational Biology Bioinformatics >Development of Text Mining Tools for Information Retrieval from Patents
【24h】

Development of Text Mining Tools for Information Retrieval from Patents

机译:从专利中检索信息检索的文本挖掘工具的开发

获取原文

摘要

Biomedical literature is composed of an ever increasing number of publications in natural language. Patents are a relevant fraction of those, being important sources of information due to all the curated data from the granting process. However, their unstructured data turns the search of information a challenging task. To surpass that, Biomedical text mining (BioTM) creates methodologies to search and structure that data. Several BioTM techniques can be applied to patents. From those, Information Retrieval is the process where relevant data is obtained from collections of documents. In this work, a patent pipeline was developed and integrated into @Note2, an open-source computational framework for BioTM. This integration allows to run further BioTM tools over the patent documents, including Information Extraction processes as Named Entity Recognition or Relation Extraction.
机译:生物医学文献由越来越多的自然语言出版物组成。专利是一种相关的部分,是由于来自授权过程的所有策划数据,是信息的重要信息。但是,它们的非结构化数据会导致信息的搜索成为一个具有挑战性的任务。要超越,生物医学文本挖掘(Biotm)会创建方法,以搜索和结构该数据。可以应用几种Biotm技术。从那些,信息检索是从文件集合获得相关数据的过程。在这项工作中,开发了专利管线并集成到@ Note2中,是Biotm的开源计算框架。该集成允许通过专利文档运行其他生物工具,包括作为命名实体识别或关系提取的信息提取过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号