首页> 外文期刊>Methods of information in medicine >Automatic DPC code selection from electronic medical records: text mining trial of discharge summary.
【24h】

Automatic DPC code selection from electronic medical records: text mining trial of discharge summary.

机译:从电子病历中自动选择DPC代码:文本摘要的出院试验摘要。

获取原文
获取原文并翻译 | 示例
           

摘要

OBJECTIVES: We extracted index terms related to diseases recorded in hospital discharge summaries and examined the capability of the vector space model to select a suitable diagnosis with these terms. METHODS: By morphological analysis, we extracted index terms and constructed an original dictionary for the discharge summary analysis. We chose 125 different DPC (Japanese DRG system) codes for the diseases, each of which had more than 20 cases. We divided them into two groups. One group consisted of 5927 cases from 2004 fiscal year and was used to generate the document vector space according to the DPC. The other group of 3187 cases was collected to verify the automatic DPC selection by using data from 2005 fiscal year. The top 200 extracted index terms for each disease were used to calculate the weight of each disease. RESULTS: The DPC code obtained by the calculated similarity was compared with the original codes of patients for 125 DPCs of 3187 cases. Eighty percent of the cases matched the diagnosis of the DPC (first six digits) and 56% of the cases completely matched all 14 digits of the DPC. CONCLUSIONS: We demonstrated that we could extract suitable terms for each disease and obtain characteristics, such as the diagnosis, from the calculated vectors. This technique can be used to measure the qualification of discharge summaries and to integrate discharge summaries among different facilities. By the text mining technique, we can characterize the contents of electronic discharge summaries and deduce diagnoses with the data.
机译:目的:我们提取了与出院总结中记录的疾病相关的索引术语,并检查了向量空间模型选择具有这些术语的合适诊断的能力。方法:通过形态学分析,我们提取了索引项并构建了用于排放汇总分析的原始字典。我们为疾病选择了125种不同的DPC(日本DRG系统)代码,每种代码都有20多个病例。我们将它们分为两组。一组来自2004财政年度的5927个案例,并根据DPC用于生成文档向量空间。收集了其他3187个案例,以使用2005财政年度的数据来验证DPC的自动选择。每种疾病的前200个提取索引项用于计算每种疾病的权重。结果:将计算出的相似度获得的DPC码与3187例125个DPC的患者原始码进行比较。 80%的病例符合DPC的诊断(前六位数字),56%的病例完全符合DPC的所有14位数字。结论:我们证明了我们可以为每种疾病提取合适的术语,并从计算出的载体中获得特征,例如诊断。此技术可用于测量排放汇总的资格,并整合不同设施之间的排放汇总。通过文本挖掘技术,我们可以表征电子放电摘要的内容,并根据数据得出诊断。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号