首页> 外文会议>Australian Conference on Progress in Artificial Life(ACAL 2007); 20071204-06; Gold Coast(AU) >Identification of Marker Genes Discriminating the Pathological Stages in Ovarian Carcinoma by Using Support Vector Machine and Systems Biology
【24h】

Identification of Marker Genes Discriminating the Pathological Stages in Ovarian Carcinoma by Using Support Vector Machine and Systems Biology

机译:利用支持向量机和系统生物学技术鉴定可区分卵巢癌病理分期的标记基因

获取原文
获取原文并翻译 | 示例

摘要

Ovarian cancer is a primary gynecological cancer which pathological stages include benign, borderline and invasive stages cause death in many countries. In this paper, linear regression, analysis of variance (ANOVA) and support vector machine (S VM) are used to identify the gene markers of ovarian cancer for an authentic cDNA expression datasets among 8 normal ovarian tumors, 6 borderline of cancers, 7 ovarian cancer at stage Ⅰ and 9 ovarian cancer at stage Ⅲ samples. First, the linear regression analysis obtains 200 useful genes with largest residuals. Further select 14 genes by ANOVA and Scheffe when P-value is less than 0.000005. Then, we use support vector machine to classify the pathological stages by gene expressions. Five experiments are performed with clustering conditions. In the first clustering experiment, the cluster 1 includes BOT, and other pathological stages are in cluster 2. They have significant differences at BOT stage and can get average accuracy about 95.686% in cross-validation. It is quite precise for classifying pathological stages by gene expressions. The average accuracy of all clustering experiments is about 88.541% in cross-validation. Besides, we also develop a statistical analysis system including linear regression and ANOVA function for gene expression analysis. The experimental results and our analysis system can assist biologists and doctors to research and diagnose ovarian cancer by gene expressions.
机译:卵巢癌是一种原发性妇科癌症,其病理阶段包括良性,边缘性和浸润性阶段,在许多国家引起死亡。本文使用线性回归,方差分析(ANOVA)和支持向量机(S VM)来鉴定卵巢癌的基因标记,以鉴定8个正常卵巢肿瘤,6个癌症边界线,7个卵巢癌中的真实cDNA表达数据集Ⅰ期癌和9期Ⅲ期卵巢癌。首先,线性回归分析获得了200个残基最大的有用基因。当P值小于0.000005时,通过ANOVA和Scheffe进一步选择14个基因。然后,我们使用支持向量机通过基因表达对病理阶段进行分类。在聚类条件下进行了五个实验。在第一个聚类实验中,聚类1包括BOT,其他病理阶段在聚类2中。它们在BOT阶段具有显着差异,并且在交叉验证中可以获得约95.686%的平均准确度。通过基因表达对病理阶段进行分类非常精确。在交叉验证中,所有聚类实验的平均准确度约为88.541%。此外,我们还开发了包括线性回归和ANOVA函数的基因表达分析的统计分析系统。实验结果和我们的分析系统可以帮助生物学家和医生通过基因表达来研究和诊断卵巢癌。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号