Increasing secondary diagnosis encoding quality using data mining techniques

机译：使用数据挖掘技术提高二级诊断编码质量

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to measure the medical activity, hospitals are required to manually encode information concerning an inpatient episode using International Classification of Disease (ICD-10). This task is time consuming and requires substantial training for the staff. We propose to help by speeding up and facilitating the tedious task of coding patient information, specially while coding some secondary diagnoses that are not well described in the medical resources such as discharge letter and medical records. Our approach leverages data mining techniques in order to explore medical databases of previously encoded secondary diagnoses and use the stored structured information (age, gender, diagnoses count, medical procedures...) to build a decision tree that assigns the proper secondary diagnosis code into the corresponding inpatient episode or indicates the impatient episodes that contains implausible secondary diagnoses. The results suggest that better performance could be achieved by using low level of diagnoses granularity along with adding some filters to balance the repartition of the negative and positive examples in the training set. The obtained results show that there is big variation in the evaluation scores of the studied diagnoses, the highest score is 75% using F1 measurement and the lowest 25% using F1 measurement which indicates further enhancements are needed to achieve better performance regardless of the encoded diagnosis. However, the average accuracy of all the studied secondary diagnoses is around 80% which indicates better negative predictions therefore it could be useful in the prevention or the detection of wrong coding assignments of secondary diagnoses in the inpatient stay.

机译：为了衡量医疗活动，要求医院使用国际疾病分类（ICD-10）手动编码有关住院发作的信息。这项任务很耗时，需要对员工进行大量培训。我们建议通过加快和简化对患者信息进行编码的繁琐任务来提供帮助，特别是在对一些在医疗资源中没有很好描述的二级诊断（例如出院信和病历）进行编码时。我们的方法利用数据挖掘技术来探索先前编码的二级诊断的医学数据库，并使用存储的结构化信息（年龄，性别，诊断计数，医疗程序...）来构建决策树，以将适当的二级诊断代码分配给相应的住院发作或表明急诊发作包含难以置信的继发性诊断。结果表明，通过使用低级别的诊断粒度以及添加一些过滤器以平衡训练集中阴性和阳性样本的重新分配，可以实现更好的性能。获得的结果表明，所研究诊断的评估分数存在较大差异，使用F1测量的最高分数为75％，使用F1测量的最低分数为25％，这表明需要进一步增强以实现更好的性能，而与编码的诊断无关。但是，所有研究过的二级诊断的平均准确度约为80％，这表明阴性预测更好，因此对于预防或检测住院期间二级诊断的错误编码分配可能很有用。

著录项

来源
《International Conference on Research Challenges in Information Science》|2016年|1-10|共10页
会议地点
作者
Ghazar Chahbandarian; Nathalie Bricon-Souf; Rémi Bastide; Jean-Christoph Steinbach;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Medical diagnostic imaging; Data mining; Encoding; Databases; Feature extraction; Hospitals;

机译：医学诊断成像;数据挖掘;编码;数据库;特征提取;医院;

相似文献

外文文献
中文文献
专利

1. Energy diagnosis of variable refrigerant flow (VRF) systems: Data mining technique and statistical quality control approach [J] . Liu Jiangyan, Liu Jiahui, Chen Huanxin, Energy and Buildings . 2018,第SEPa期

机译：可变制冷剂流量（VRF）系统的能量诊断：数据挖掘技术和统计质量控制方法
2. A Comparison And Prediction Analysis For The Diagnosis Of Parkinson Disease Using Data Mining Techniques On Voice Datasets [J] . Tarigoppula. V. S. Sriram, M. Venkateswara Rao, G. V. Satya Narayana, International Journal of Applied Engineering Research . 2016,第9aPta3期

机译：使用数据挖掘技术对语音数据集的诊断帕金森病的比较与预测分析
3. Statistical and Data Mining Techniques for Understanding Water Quality Profiles in a Mining-Affected River Basin [J] . Jose Simmonds, Juan A. Gómez, Agapito Ledezma International Journal of Agricultural and Environmental Information Systems . 2018,第2期

机译：理解采矿河流域水质型材的统计和数据挖掘技术
4. Increasing secondary diagnosis encoding quality using data mining techniques [C] . Ghazar Chahbandarian, Nathalie Bricon-Souf, Rémi Bastide, IEEE International Conference on Research Challenges in Information Science . 2016

机译：利用数据挖掘技术提高次级诊断编码质量
5. Investigation of techniques to increase the scalability of graph based data mining algorithms. [D] . Inavolu, Srilatha. 2006

机译：研究增加基于图的数据挖掘算法的可伸缩性的技术。
6. A database for using machine learning and data mining techniques for coronary artery disease diagnosis [O] . R. Alizadehsani, M. Roshanzamir, M. Abdar, 2019

机译：使用机器学习和数据挖掘技术诊断冠状动脉疾病的数据库
7. Measuring Data Quality of Geoscience Datasets Using Data Mining Techniques [O] . Cuo Cai, Kunqing Xie 2007

机译：利用数据挖掘技术测量地学数据集的数据质量

Increasing secondary diagnosis encoding quality using data mining techniques

摘要

著录项

相似文献

相关主题

期刊订阅