首页> 外文期刊>BMC Medical Informatics and Decision Making >An approach for transgender population information extraction and summarization from clinical trial text
【24h】

An approach for transgender population information extraction and summarization from clinical trial text

机译:一种从临床试验文本中提取和总结跨性别人群信息的方法

获取原文
           

摘要

Gender information frequently exists in the eligibility criteria of clinical trial text as essential information for participant population recruitment. Particularly, current eligibility criteria text contains the incompleteness and ambiguity issues in expressing transgender population, leading to difficulties or even failure of transgender population recruitment in clinical trial studies. A new gender model is proposed for providing comprehensive transgender requirement specification. In addition, an automated approach is developed to extract and summarize gender requirements from unstructured text in accordance with the gender model. This approach consists of: 1) the feature extraction module, and 2) the feature summarization module. The first module identifies and extracts gender features using heuristic rules and automatically-generated patterns. The second module summarizes gender requirements by relation inference. Based on 100,134 clinical trials from ClinicalTrials.gov , our approach was compared with 20 commonly applied machine learning methods. It achieved a macro-averaged precision of 0.885, a macro-averaged recall of 0.871 and a macro-averaged F1-measure of 0.878. The results illustrated that our approach outperformed all baseline methods in terms of both commonly used metrics and macro-averaged metrics. This study presented a new gender model aiming for specifying the transgender requirement more precisely. We also proposed an approach for gender information extraction and summarization from unstructured clinical text to enhance transgender-related clinical trial population recruitment. The experiment results demonstrated that the approach was effective in transgender criteria extraction and summarization.
机译:性别信息经常出现在临床试验文本的资格标准中,作为招募参与者人群的基本信息。特别是,当前的资格标准文本包含表达跨性别人群的不完整和含糊不清的问题,导致在临床试验研究中招募跨性别人群的困难甚至失败。提出了一种新的性别模型,以提供全面的跨性别需求规范。此外,根据性别模型,开发了一种自动方法来从非结构化文本中提取和总结性别要求。该方法包括:1)特征提取模块,和2)特征汇总模块。第一个模块使用启发式规则和自动生成的模式来识别和提取性别特征。第二个模块通过关系推断总结了性别要求。基于ClinicalTrials.gov的100,134个临床试验,我们的方法与20种常用的机器学习方法进行了比较。它实现了0.885的宏平均精度,0.871的宏平均召回率和0.878的宏平均F1量度。结果表明,就常用指标和宏观平均指标而言,我们的方法均优于所有基准方法。这项研究提出了一种新的性别模型,旨在更精确地确定变性需求。我们还提出了一种从非结构化临床文本中提取和汇总性别信息的方法,以增强跨性别相关的临床试验人群的招募。实验结果表明,该方法在变性标准的提取和总结中是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号