首页> 外文会议>Conference on empirical methods in natural language processing;International workshop on health text mining and information analysis >Unsupervised Identification of Study Descriptors in Toxicology Research: An Experimental Study
【24h】

Unsupervised Identification of Study Descriptors in Toxicology Research: An Experimental Study

机译:毒理学研究中研究指标的无监督鉴定:一项实验研究

获取原文

摘要

Identifying and extracting data elements such as study descriptors in publication full texts is a critical yet manual and labor-intensive step required in a number of tasks. In this paper we address the question of identifying data elements in an unsupervised manner. Specifically, provided a set of criteria describing specific study parameters, such as species, route of administration, and dosing regimen, we develop an unsupervised approach to identify text segments (sentences) relevant to the criteria. A binary classifier trained to identify publications that met the criteria performs better when trained on the candidate sentences than when trained on sentences randomly picked from the text, supporting the intuition that our method is able to accurately identify study descriptors.
机译:识别和提取出版物全文中的研究描述符之类的数据元素是许多任务中必不可少的关键而又需要人工和劳动的步骤。在本文中,我们以无人监督的方式解决了识别数据元素的问题。具体来说,提供了一组描述特定研究参数的标准,例如种类,给药途径和给药方案,我们开发了一种无监督的方法来识别与该标准相关的文本片段(句子)。经过训练以识别符合条件的出版物的二元分类器,在对候选句子进行训练时比在从文本中随机选取的句子进行训练时,表现更好,这支持了我们的方法能够准确识别研究描述符的直觉。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号