首页> 外文期刊>Future generation computer systems >A novel data-driven robust framework based on machine learning and knowledge graph for disease classification
【24h】

A novel data-driven robust framework based on machine learning and knowledge graph for disease classification

机译:一种基于机器学习和知识图的数据驱动的鲁棒框架,用于疾病分类

获取原文
获取原文并翻译 | 示例
       

摘要

As Noncommunicable Diseases (NCDs) are affected or controlled by diverse factors such as age, regionalism, timeliness or seasonality, they are always challenging to be treated accurately, which has impacted on daily life and work of patients. Unfortunately, although a number of researchers have already made some achievements (including clinical or even computer-based) on certain diseases, current situation is eager to be improved via computer technologies such as data mining and Deep Learning. In addition, the progress of NCD research has been hampered by privacy of health and medical data. In this paper, a hierarchical idea has been proposed to study the effects of various factors on diseases, and a data-driven framework named d-DC with good extensibility is presented. d-DC is able to classify the disease according to the occupation on the premise where the disease is occurring in a certain region. During collecting data, we used a combination of personal or family medical records and traditional methods to build a data acquisition model. Not only can it realize automatic collection and replenishment of data, but it can also effectively tackle the cold start problem of the model with relatively few data effectively. The diversity of information gathering includes structured data and unstructured data (such as plain texts, images or videos), which contributes to improve the classification accuracy and new knowledge acquisition. Apart from adopting machine learning methods, d-DC has employed knowledge graph (KG) to classify diseases for the first time. The vectorization of medical texts by using knowledge embedding is a novel consideration in the classification of diseases. When results are singular, the medical expert system was proposed to address inconsistencies through knowledge bases or online experts. The results of d-DC are displayed by using a combination of KG and traditional methods, which intuitively provides a reasonable interpretation to the results (highly descriptive). Experiments show that d-DC achieved the improved accuracy than the other previous methods. Especially, a fusion method called RKRE based on both ResNet and the expert system attained an average correct proportion of 86.95%, which is a good feasibility study in the field of disease classification. (C) 2019 Elsevier B.V. All rights reserved.
机译:由于非传染性疾病(NCD)受年龄,区域性,及时性或季节性等多种因素的影响或控制,它们始终面临着难以准确治疗的挑战,这影响了患者的日常生活和工作。不幸的是,尽管许多研究人员已经在某些疾病上取得了一些成就(包括基于临床或什至基于计算机的成就),但仍希望通过诸如数据挖掘和深度学习之类的计算机技术来改善当前状况。此外,NCD研究的进展受到健康和医疗数据隐私的阻碍。本文提出了一种层次化的思想来研究各种因素对疾病的影响,并提出了一个具有良好可扩展性的数据驱动框架d-DC。 d-DC可以在特定地区发生疾病的前提下,根据职业将疾病分类。在收集数据期间,我们结合了个人或家庭病历和传统方法来建立数据采集模型。它不仅可以实现数据的自动收集和补充,还可以有效地解决数据量相对较少的模型的冷启动问题。信息收集的多样性包括结构化数据和非结构化数据(例如纯文本,图像或视频),这有助于提高分类准确性和获取新知识。除了采用机器学习方法外,d-DC还首次使用知识图(KG)对疾病进行分类。通过使用知识嵌入对医学文本进行矢量化是疾病分类中的一个新考虑。当结果是单一的时,建议使用医学专家系统来通过知识库或在线专家解决不一致问题。 d-DC的结果通过结合KG和传统方法来显示,从而直观地为结果提供了合理的解释(高度描述性)。实验表明,d-DC比其他以前的方法具有更高的精度。尤其是,基于ResNet和专家系统的RKRE融合方法平均正确率为86.95%,在疾病分类领域是一个很好的可行性研究。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号