首页> 外文会议>Workshop on Genome Informatics >DNA microarray data analysis for cancer classification based on stepwise discriminant analysis and Bayesian decision theory
【24h】

DNA microarray data analysis for cancer classification based on stepwise discriminant analysis and Bayesian decision theory

机译:基于逐步判别分析和贝叶斯决策理论的癌症分类DNA微阵列数据分析

获取原文

摘要

DNA microarray are providing an unprecedented amount of information about the genetic changes. From the analysis of the data, we can get insights into various diseases such as cancer whose information is difficult to be obtained. Golub et al. showedthe possibility of cancer classification based on gene expression profiles. They clearly showed that the detection of wholesale changes in gene expression patterns is possible. However, researchers could not identify the genes whose activity is turned up or down. They also could not find which changes of genes are important for cancer development and progression-the causes and not just the effects. Therefore, the useful methods to analyze DNA rnicroarray data have not been proposed. In this paper, we find the causal relationship between several tumors and the gene-expression data by sequentially using the stepwise discriminant analysis method (SDA) and Bayesian decision theory (BDT). Eighty-five samples containing four tumor classes are used in this study. The classes are neuroblastoma (NB), rhabdomyosarcoma (RMS), non-Hodgkin lymphoma (BL) and the Ewing family of tumor (EWS). These data are open on world wide web. SDA is used to select critical genes for accurate classification of 4 tumors from original 2308 genes. With the selected genes, Bayesian classifier is made, which minimizes the misclassification rate. As a result, the classification performance increases to 100 percent, and 9 new genes that have relation with the development of the tumorsis found additionally.
机译:DNA微阵列正在提供有关遗传变化的前所未有的信息。根据数据的分析,我们可以对诸如难以获得的癌症等各种疾病的见解。 golub等人。展示了基于基因表达谱的癌症分类的可能性。他们清楚地表明,可以进行基因表达模式的批发变化。但是,研究人员无法识别活动被上下或下降的基因。他们也找不到哪种基因的变化对于癌症发展和进展是重要的 - 原因,而不仅仅是效果。因此,尚未提出分析DNA Rnicroarray数据的有用方法。在本文中,我们通过逐步判别分析方法(SDA)和贝叶斯决策理论(BDT)顺序地发现了几种肿瘤与基因表达数据之间的因果关系。本研究使用含有四种肿瘤课程的八十五个样品。该类是神经母细胞瘤(NB),横纹肌肉瘤(RMS),非霍奇金淋巴瘤(BL)和肿瘤的胚胎(EWS)。这些数据在万维网上开放。 SDA用于选择来自原始2308基因的4个肿瘤的准确分类的关键基因。通过所选基因,制造贝叶斯分类,这使得最小化错误分类率。结果,分类性能增加到100%,9个新的基因与肿瘤的发展另外发现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号