...
首页> 外文期刊>Genomics & Informatics >Using the PubAnnotation ecosystem to perform agile text mining on Genomics & Informatics: a tutorial review
【24h】

Using the PubAnnotation ecosystem to perform agile text mining on Genomics & Informatics: a tutorial review

机译:使用Pubannotation Ecosystem在基因组学和信息学上执行敏捷文本挖掘:教程评论

获取原文
           

摘要

The prototype version of the full-text corpus of Genomics & Informatics has recently been archived in a GitHub repository. The full-text publications of volumes 10 through 17 are also directly downloadable from PubMed Central (PMC) as XML files. During the Biomedical Linked Annotation Hackathon 6 (BLAH6), we experimented with converting, annotating, and updating 301 PMC full-text articles of Genomics & Informatics using PubAnnotation, a system that provides a convenient way to add PMC publications based on PMCID. Thus, this review aims to provide a tutorial overview of practicing the iterative task of named entity recognition with the PubAnnotation/PubDictionaries/TextAE ecosystem. We also describe developing a conversion tool between the Genia tagger output and the JSON format of PubAnnotation during the hackathon.
机译:最近在GitHub存储库中存档了基因组学和信息学的全文语料库的原型版本。卷10到17的全文出版物也是从PubMed Central(PMC)作为XML文件下载。在生物医学链接的注释Hackathon 6(Blah6)期间,我们使用Pubannotation进行了转换,注释和更新301 PMC全文文章的基因组学和信息学,该系统提供了基于PMCID添加PMC出版物的便捷方式。因此,本综述旨在提供与Pubannotation / Pubdictionaries / Textae生态系统一起练习命名实体识别的迭代任务的教程概述。我们还描述了在Hackathon期间,在Genia标签输出和Pubannotation的JSON格式之间开发转换工具。

著录项

获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号