首页> 美国卫生研究院文献>other >Mucopolysaccharidosis type II detection by Naïve Bayes Classifier: An example of patient classification for a rare disease using electronic medical records from the Canadian Primary Care Sentinel Surveillance Network
【2h】

Mucopolysaccharidosis type II detection by Naïve Bayes Classifier: An example of patient classification for a rare disease using electronic medical records from the Canadian Primary Care Sentinel Surveillance Network

机译:朴素贝叶斯分类器检测II型粘多糖贮积病:使用加拿大初级保健前哨监视网络的电子病历对罕见疾病进行患者分类的示例

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Identifying patients with rare diseases associated with common symptoms is challenging. Hunter syndrome, or Mucopolysaccharidosis type II is a progressive rare disease caused by a deficiency in the activity of the lysosomal enzyme, iduronate 2-sulphatase. It is inherited in an X-linked manner resulting in males being significantly affected. Expression in females varies with the majority being unaffected although symptoms may emerge over time. We developed a Naïve Bayes classification (NBC) algorithm utilizing the clinical diagnosis and symptoms of patients contained within their de-identified and unstructured electronic medical records (EMR) extracted by the Canadian Primary Care Sentinel Surveillance Network (CPCSSN). To do so, we created a training dataset using published results in the scientific literature and from all MPS II symptoms and applied the training dataset and its independent features to compute the conditional posterior probabilities of having MPS II disease as a categorical dependent variable for 506497 male patients. The classifier identified 125 patients with the highest likelihood for having the disease and 18 features were selected to be necessary for forecasting. Next, a Recursive Backward Feature Elimination algorithm was employed, for optimal input features of the NBC model, using a k-fold Cross-Validation with 3 replicates. The accuracy of the final model was estimated by the Validation Set Approach technique and the bootstrap resampling. We also investigated that whether the NBC is as accurate as three other Bayesian networks. The Naïve Bayes Classifier appears to be an efficient algorithm in assisting physicians with the diagnosis of Hunter syndrome allowing optimal patient management.
机译:确定具有常见症状的罕见疾病的患者具有挑战性。亨特综合症或II型粘多糖贮积病是一种进行性稀有疾病,由溶酶体酶(二聚异氰酸酯)的活性不足引起。它以X连锁方式遗传,导致雄性受到严重影响。尽管症状可能会随着时间的流逝而出现,但女性的表达却有所不同,其中大多数未受影响。我们利用加拿大初级保健前哨监视网络(CPCSSN)提取的身份不明和非结构化电子病历(EMR)中包含的患者的临床诊断和症状,开发了朴素贝叶斯分类(NBC)算法。为此,我们使用科学文献中发表的结果并根据所有MPS II症状创建了一个训练数据集,并应用该训练数据集及其独立功能来计算MPS II疾病作为506497个男性的绝对因变量的条件后验概率耐心。分类器确定了125例患此病可能性最高的患者,并选择了18项特征进行预测。接下来,对于3个副本,使用k倍交叉验证,对NBC模型的最佳输入特征采用了递归后向特征消除算法。最终模型的准确性通过“验证集方法”技术和自举重采样进行了估计。我们还调查了NBC是否与其他三个贝叶斯网络一样准确。朴素的贝叶斯分类器似乎是一种有效的算法,可以帮助医生诊断亨特综合症,从而实现最佳的患者管理。

著录项

相似文献

  • 外文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号