...
首页> 外文期刊>Disease markers >A semiautomated framework for integrating expert knowledge into disease marker identification
【24h】

A semiautomated framework for integrating expert knowledge into disease marker identification

机译:将专家知识整合到疾病标记识别中的半自动化框架

获取原文
获取原文并翻译 | 示例
           

摘要

Background. The availability of large complex data sets generated by high throughput technologies has enabled the recent proliferation of disease biomarker studies. However, a recurring problem in deriving biological information from large data sets is how to best incorporate expert knowledge into the biomarker selection process. Objective. To develop a generalizable framework that can incorporate expert knowledge into data-driven processes in a semiautomatedwaywhile providing a metric for optimization in a biomarker selection scheme. Methods. The framework was implemented as a pipeline consisting of five components for the identification of signatures from integrated clustering (ISIC). Expert knowledge was integrated into the biomarker identification process using the combination of two distinct approaches; a distance-based clustering approach and an expert knowledge-driven functional selection. Results. The utility of the developed framework ISIC was demonstrated on proteomics data from a study of chronic obstructive pulmonary disease (COPD). Biomarker candidates were identified in a mouse model using ISIC and validated in a study of a human cohort. Conclusions. Expert knowledge can be introduced into a biomarker discovery process in different ways to enhance the robustness of selected marker candidates. Developing strategies for extracting orthogonal and robust features from large data sets increases the chances of success in biomarker identification.
机译:背景。由高通量技术生成的大型复杂数据集的可用性使疾病生物标记物研究的近期扩散成为可能。然而,从大数据集获取生物信息的一个反复出现的问题是如何将专家知识最佳地整合到生物标志物选择过程中。目的。开发一个可通用的框架,该框架可将专家知识以半自动化的方式整合到数据驱动的流程中,同时为生物标记选择方案的优化提供度量。方法。该框架被实施为包含五个组件的管道,用于从集成群集(ISIC)识别签名。通过两种不同方法的结合,将专家知识整合到生物标志物识别过程中。基于距离的聚类方法和专家知识驱动的功能选择。结果。慢性阻塞性肺疾病(COPD)研究的蛋白质组学数据证明了已开发框架ISIC的实用性。使用ISIC在小鼠模型中鉴定了候选生物标志物,并在一项人类队列研究中对其进行了验证。结论。可以以不同方式将专家知识引入生物标记物发现过程,以增强所选标记物候选物的鲁棒性。开发用于从大型数据集中提取正交特征和鲁棒特征的策略,增加了生物标记识别成功的机会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号