首页> 外文会议>International Joint Conference on Artificial Intelligence >Bayesian Nonparametric Collaborative Topic Poisson Factorization for Electronic Health Records-Based Phenotyping
【24h】

Bayesian Nonparametric Collaborative Topic Poisson Factorization for Electronic Health Records-Based Phenotyping

机译:贝叶斯非参数协同主题泊松归类为基于电子健康记录的表型

获取原文

摘要

Phenotyping with electronic health records (EHR) has received much attention in recent years because the phenotyping opens a new way to discover clinically meaningful insights, such as disease progression and disease subtypes without human supervisions. In spite of its potential benefits, the complex nature of EHR often requires more sophisticated methodologies compared with traditional methods. Previous works on EHR-based phenotyping utilized unsupervised and supervised learning methods separately by independently detecting phenotypes and predicting medical risk scores. To improve EHR-based phenotyping by bridging the separated methods, we present Bayesian non-parametric collaborative topic Poisson factorization (BN-CTPF) that is the first nonparametric content-based Poisson factorization and first application of jointly analyzing the phenotye topics and estimating the individual risk scores. BN-CTPF shows better performances in predicting the risk scores when we compared the model with previous matrix factorization and topic modeling methods including a Poisson factorization and its collaborative extensions. Also, BN-CTPF provides faceted views on the phenotype topics by patients' demographics. Finally, we demonstrate a scalable stochastic variational inference algorithm by applying BN-CTPF to a national-scale EHR dataset.
机译:电子健康记录(EHR)的表型近年来受到了很多关注,因为该表型开辟了一种发现临床有意义的见解的新方法,例如没有人为监督的疾病进展和疾病亚型。尽管其潜在的好处,与传统方法相比,EHR的复杂性通常需要更复杂的方法。以前通过独立检测表型和预测医疗风险分数来分别利用无监督和监督学习方法的基于EHR的表型。为了通过弥合分离的方法来提高基于EHR的表型,我们呈现贝叶斯非参数协作主题泊松分解(BN-CTPF),这是第一个基于非参数含量的泊松分子,并首先应用联合分析了比特主题和估计个体风险分数。 BN-CTPF在将模型与先前的矩阵分解和主题建模方法进行比较时,BN-CTPF在预测风险分数方面表现出更好的表现,包括泊松分解及其协作扩展。此外,BN-CTPF在患者人口统计学的表型主题上提供了面对面的视图。最后,我们通过将BN-CTPF应用于全国级EHR数据集来展示可扩展的随机变分推理算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号