首页> 外文期刊>Genes >An Integrated Approach for Identifying Molecular Subtypes in Human Colon Cancer Using Gene Expression Data
【24h】

An Integrated Approach for Identifying Molecular Subtypes in Human Colon Cancer Using Gene Expression Data

机译:利用基因表达数据鉴定人结肠癌分子亚型的综合方法

获取原文
           

摘要

Identifying molecular subtypes of colorectal cancer (CRC) may allow for more rational, patient-specific treatment. Various studies have identified molecular subtypes for CRC using gene expression data, but they are inconsistent and further research is necessary. From a methodological point of view, a progressive approach is needed to identify molecular subtypes in human colon cancer using gene expression data. We propose an approach to identify the molecular subtypes of colon cancer that integrates denoising by the Bayesian robust principal component analysis (BRPCA) algorithm, hierarchical clustering by the directed bubble hierarchical tree (DBHT) algorithm, and feature gene selection by an improved differential evolution based feature selection method (DEFS W ) algorithm. In this approach, the normal samples being completely and exclusively clustered into one class is considered to be the standard of reasonable clustering subtypes, and the feature selection pays attention to imbalances of samples among subtypes. With this approach, we identified the molecular subtypes of colon cancer on the mRNA gene expression dataset of 153 colon cancer samples and 19 normal control samples of the Cancer Genome Atlas (TCGA) project. The colon cancer was clustered into 7 subtypes with 44 feature genes. Our approach could identify finer subtypes of colon cancer with fewer feature genes than the other two recent studies and exhibits a generic methodology that might be applied to identify the subtypes of other cancers.
机译:鉴定结直肠癌(CRC)的分子亚型可能允许进行更合理的,针对患者的治疗。各种研究已经使用基因表达数据确定了CRC的分子亚型,但是它们并不一致,需要进一步的研究。从方法学的观点来看,需要一种渐进的方法来使用基因表达数据鉴定人结肠癌中的分子亚型。我们提出了一种识别结肠癌分子亚型的方法,该方法整合了由贝叶斯鲁棒主成分分析(BRPCA)算法进行的去噪,由有向气泡层次树(DBHT)算法进行的层次聚类以及通过基于改进的差异进化的特征基因选择特征选择方法(DEFS W)算法。在这种方法中,将正常样本完全且仅聚类为一类被视为合理聚类子类型的标准,并且特征选择要注意子类型之间样本的不平衡性。通过这种方法,我们在癌症基因组图谱(TCGA)项目的153个结肠癌样品和19个正常对照样品的mRNA基因表达数据集上确定了结肠癌的分子亚型。结肠癌被聚类为具有44个特征基因的7个亚型。与其他两项最新研究相比,我们的方法可以识别特征基因较少的更精细的结肠癌亚型,并且展现出可用于识别其他癌症亚型的通用方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号