首页> 外文会议>IEEE International Conference on e-Health Networking, Applications and Services >Medical data mining: A case study of a Paracoccidioidomycosis patient's database
【24h】

Medical data mining: A case study of a Paracoccidioidomycosis patient's database

机译:医学数据挖掘:副球菌病患者数据库的案例研究

获取原文

摘要

Data mining applied to medical databases is a challenging process. The unavailability of large sources of data and data complexity are some of the difficulties encountered. This is especially true for rare and neglected diseases. Those databases are, in general, relatively small, wide and sparse, making them very challenging to analyze. There are also ethical, legal and social issues regarding privacy and clinical validation of the findings. This work proposes a way of dealing with this challenge with a case study of data mining applied in a Paracoccidioidomycosis (PCM) patients database. Paracoccidioidomycosis (PCM) is a typical Brazilian disease, caused by the yeast Paracoccidioides brasiliensis. This disease represents an important Public Health issue, due to its high incapacitating potential and the amount of premature deaths it causes if untreated. This paper discusses methods for the analysis of this complex dataset, to help increase the understanding of both the disease and this type of data. Despite the challenges of the dataset, some interesting findings were made being: flaws in form filling protocols, notably the lack of chest X-ray in 40% of the records; the discovery of a possible new relation between smoking habits and PCM evolution time. The average evolution time for smoking patients was 2.8 times longer; the successful classification/prediction of the cutaneous form of the disease with a 93% precision rate are some of the discoveries made.
机译:将数据挖掘应用于医疗数据库是一个具有挑战性的过程。大型数据源的不可用和数据复杂性是遇到的一些困难。对于罕见和被忽视的疾病尤其如此。这些数据库通常相对较小,范围广且稀疏,因此很难进行分析。关于发现的隐私和临床验证,还存在道德,法律和社会问题。这项工作提出了一种解决这种挑战的方法,该方法是通过在球菌类(PCM)患者数据库中应用数据挖掘的案例研究来进行的。副球菌病(PCM)是一种典型的巴西疾病,由巴西副球菌(Paracoccidioides brasiliensis)引起。这种疾病代表了一个重要的公共卫生问题,因为它具有很高的丧失工作能力,而且如果不加以治疗,会导致过早死亡。本文讨论了分析此复杂数据集的方法,以帮助增进对疾病和此类数据的了解。尽管数据集存在挑战,但还是取得了一些有趣的发现:表单填写协议中存在缺陷,尤其是40%的记录中缺少胸部X射线检查;发现吸烟习惯与PCM进化时间之间可能存在新的关系。吸烟患者的平均进化时间延长了2.8倍。成功地对疾病的皮肤形式进行分类/预测,准确率达93%,这是一些发现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号