首页> 外文会议>Conference on Artificial Intelligence in Medicine >Generating Positive Psychosis Symptom Keywords from Electronic Health Records
【24h】

Generating Positive Psychosis Symptom Keywords from Electronic Health Records

机译:从电子病历中生成阳性精神病症状关键字

获取原文

摘要

The development of Natural Language Processing (NLP) solutions for information extraction from electronic health records (EHRs) has grown in recent years, as most clinically relevant information in EHRs is documented only in free text. One of the core tasks for any NLP system is to extract clinically relevant concepts such as symptoms. This information can then be used for more complex problems such as determining symptom onset, which requires temporal information. In the mental health domain, comprehensive vocabularies for specific disorders are scarce, and rarely contain keywords that reflect real-world terminology use. We explore the use of embedding techniques to automatically generate lexical variants of psychosis symptoms into vocabularies, that can be used in complex downstream NLP tasks. We study the impact of the underlying text material on generating useful lexical entries, experimenting with different corpora and with unigram/bigram models. We also propose a method to automatically compute thresholds for choosing the most relevant terms. Our main contribution is a systematic study of unsupervised vocabulary generation using different corpora for an understudied clinical use-case. Resulting lexicons are publicly available.
机译:近年来,用于从电子健康记录(EHR)中提取信息的自然语言处理(NLP)解决方案的发展不断增长,因为EHR中大多数与临床相关的信息仅以自由文本记录。任何NLP系统的核心任务之一是提取临床相关概念,例如症状。然后可以将此信息用于更复杂的问题,例如确定症状发作,这需要时间信息。在心理健康领域,针对特定疾病的综合词汇很少,并且很少包含反映现实世界中术语使用的关键字。我们探索使用嵌入技术自动将精神病症状的词汇变体生成词汇表,并将其用于复杂的下游NLP任务中。我们研究了基础文本材料对生成有用词汇条目的影响,并尝试了不同的语料库和单字组/二字组模型。我们还提出了一种自动计算用于选择最相关术语的阈值的方法。我们的主要贡献是针对未充分研究的临床用例使用不同语料库对无监督词汇生成进行系统研究。最终的词典是公开可用的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号