首页> 美国卫生研究院文献>AMIA Annual Symposium Proceedings >Throw the Bath Water Out Keep the Baby: Keeping Medically-Relevant Terms for Text Mining
【2h】

Throw the Bath Water Out Keep the Baby: Keeping Medically-Relevant Terms for Text Mining

机译:扔掉洗澡水养婴儿:保留医学上与文本挖掘相关的术语

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The purpose of this research is to answer the question, can medically-relevant terms be extracted from text notes and text mined for the purpose of classification and obtain equal or better results than text mining the original note? A novel method is used to extract medically-relevant terms for the purpose of text mining. A dataset of 5,009 EMR text notes (1,151 related to falls) was obtained from a Veterans Administration Medical Center. The dataset was processed with a natural language processing (NLP) application which extracted concepts based on SNOMED-CT terms from the Unified Medical Language System (UMLS) Metathesaurus. SAS Enterprise Miner was used to text mine both the set of complete text notes and the set represented by the extracted concepts. Logistic regression models were built from the results, with the extracted concept model performing slightly better than the complete note model.
机译:这项研究的目的是回答这个问题,是否可以从文本笔记和为分类目的而提取的文本中提取与医学相关的术语,并获得与挖掘原始注释相同或更好的结果?为了文本挖掘的目的,使用了一种新颖的方法来提取医学上相关的术语。从退伍军人管理局医学中心获得了5,009个EMR文本注释(与瀑布有关的1,151个)的数据集。使用自然语言处理(NLP)应用程序处理数据集,该应用程序从统一医学语言系统(UMLS)元同义词库中提取基于SNOMED-CT术语的概念。 SAS Enterprise Miner用于文本挖掘完整文本注释集和提取概念所代表的文本集。从结果构建逻辑回归模型,提取的概念模型的性能略好于完整的音符模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号