首页> 外文会议>IEEE International Conference on Healthcare Informatics >Analyzing Patterns of Literature-Based Phenotyping Definitions for Text Mining Applications
【24h】

Analyzing Patterns of Literature-Based Phenotyping Definitions for Text Mining Applications

机译:分析文本挖掘应用中基于文献的表型定义的模式

获取原文

摘要

Phenotyping definitions are widely used in observational studies that utilize population data from Electronic Health Records (EHRs). Biomedical text mining supports biomedical knowledge discovery. Therefore, we believe that mining phenotyping definitions from the literature can support EHR-based clinical research. However, information about these definitions presented in the literature is inconsistent, diverse, and unknown, especially for text mining usage. Therefore, we aim to analyze patterns of phenotyping definitions as a first step toward developing a text mining application to improve phenotype definition. A set random of observational studies was used for this analysis. Term frequency-inverse document frequency (TF-IDF) and Term Frequency (TF) were used to rank the terms in the 3958 sentences. Finally, we present preliminary results analyzing phenotyping definitions patterns.
机译:表型定义广泛用于利用电子健康记录(EHR)的人口数据进行的观察研究中。生物医学文本挖掘支持生物医学知识发现。因此,我们认为从文献中挖掘表型定义可以支持基于EHR的临床研究。但是,文献中提供的有关这些定义的信息是不一致,多样化和未知的,尤其是对于文本挖掘的使用而言。因此,我们旨在分析表型定义的模式,作为开发文本挖掘应用程序以改善表型定义的第一步。一组随机观察研究用于该分析。术语频率逆文档频率(TF-IDF)和术语频率(TF)用于对3958个句子中的术语进行排名。最后,我们提出了分析表型定义模式的初步结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号