首页> 外文期刊>Cancer Informatics >Extending Information Retrieval Methods to Personalized Genomic-Based Studies of Disease
【24h】

Extending Information Retrieval Methods to Personalized Genomic-Based Studies of Disease

机译:将信息检索方法扩展到基于基因组的个性化疾病研究

获取原文
           

摘要

Genomic-based studies of disease now involve diverse types of data collected on large groups of patients. A major challenge facing statistical scientists is how best to combine the data, extract important features, and comprehensively characterize the ways in which they affect an individual’s disease course and likelihood of response to treatment. We have developed a survival-supervised latent Dirichlet allocation (survLDA) modeling framework to address these challenges. Latent Dirichlet allocation (LDA) models have proven extremely effective at identifying themes common across large collections of text, but applications to genomics have been limited. Our framework extends LDA to the genome by considering each patient as a “document” with “text” detailing his/her clinical events and genomic state. We then further extend the framework to allow for supervision by a time-to-event response. The model enables the efficient identification of collections of clinical and genomic features that co-occur within patient subgroups, and then characterizes each patient by those features. An application of survLDA to The Cancer Genome Atlas ovarian project identifies informative patient subgroups showing differential response to treatment, and validation in an independent cohort demonstrates the potential for patient-specific inference.
机译:现在,基于基因组的疾病研究涉及从大量患者中收集的各种类型的数据。统计科学家面临的主要挑战是如何最好地组合数据,提取重要特征并全面表征它们影响个体疾病进程和对治疗反应的可能性的方式。我们已经开发了生存监督的潜在狄利克雷分配(survLDA)建模框架来应对这些挑战。潜在的狄利克雷分配(LDA)模型已被证明在识别大量文本集合中常见的主题方面极为有效,但是在基因组学中的应用受到限制。我们的框架通过将每位患者视为“文档”,并详细说明其临床事件和基因组状态的“文本”,将LDA扩展到基因组。然后,我们进一步扩展框架,以允许通过事件响应时间进行监督。该模型能够有效地识别患者亚组内同时出现的临床和基因组特征的集合,然后通过这些特征来表征每个患者。 survLDA在“癌症基因组图集”卵巢癌项目中的应用可识别信息丰富的患者亚组,这些亚组显示出对治疗的不同反应,并且在独立队列中的验证证明了进行患者特异性推断的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号