首页> 外文会议>19th international conference on software engineering and data engineering 2010 >A Novel Combinatorial Score for Feature Selection with P-Tree in DNA Microarray Data Analysis
【24h】

A Novel Combinatorial Score for Feature Selection with P-Tree in DNA Microarray Data Analysis

机译:DNA芯片数据分析中带有P树的特征选择的新型组合评分

获取原文
获取原文并翻译 | 示例

摘要

DNA microarray experiments are being used to gather information from tissue and cell samples by generating thousands of gene expression measurements. Many researchers are conducting researches regarding gene expression differences, which is useful in disease diagnose, outcome prediction, cancer type classification and etc. In mining high-dimensional microarray data, feature selection is an important pre-processing stage. In the literature nearly all existing supervised feature selection methods use class labels as supervision information. In this paper, we propose a novel score using the label correlation in combination with the correlation between features. We design a Combinatorial Score feature selection algorithm in P-Tree structure and combine it with K-Nearest-Neighbor algorithm for breast cancer clinic metastasis time prediction. Our experiments suggest that our Combinatorial Score feature selection algorithm can find a subset of genes with high computation efficiency and significant performance for breast cancer clinical metastasis prediction.
机译:通过产生数千个基因表达测量结果,DNA微阵列实验被用于从组织和细胞样品中收集信息。许多研究人员正在进行有关基因表达差异的研究,这些差异可用于疾病诊断,结果预测,癌症类型分类等。在挖掘高维微阵列数据时,特征选择是重要的预处理阶段。在文献中,几乎所有现有的受监督特征选择方法都使用类标签作为监督信息。在本文中,我们提出了一种使用标签相关性与特征之间相关性相结合的新分数。我们设计了一种P树结构的组合评分特征选择算法,并将其与K最近邻算法相结合,用于乳腺癌临床转移时间的预测。我们的实验表明,我们的组合得分特征选择算法可以找到具有较高计算效率并在乳腺癌临床转移预测中具有显着性能的基因子集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号