首页> 外文期刊>Bioinformatics >Multidimensional support vector machines for visualization of gene expression data
【24h】

Multidimensional support vector machines for visualization of gene expression data

机译:多维支持向量机,用于可视化基因表达数据

获取原文
获取原文并翻译 | 示例
       

摘要

MOTIVATION: Since DNA microarray experiments provide us with huge amount of gene expression data, they should be analyzed with statistical methods to extract the meanings of experimental results. Some dimensionality reduction methods such as Principal Component Analysis (PCA) are used to roughly visualize the distribution of high dimensional gene expression data. However, in the case of binary classification of gene expression data, PCA does not utilize class information when choosing axes. Thus clearly separable data in the original space may not be so in the reduced space used in PCA. RESULTS: For visualization and class prediction of gene expression data, we have developed a new SVM-based method called multidimensional SVMs, that generate multiple orthogonal axes. This method projects high dimensional data into lower dimensional space to exhibit properties of the data clearly and to visualize a distribution of the data roughly. Furthermore, the multiple axes can be used for class prediction. The basic properties of conventional SVMs are retained in our method: solutions of mathematical programming are sparse, and nonlinear classification is implemented implicitly through the use of kernel functions. The application of our method to the experimentally obtained gene expression datasets for patients' samples indicates that our algorithm is efficient and useful for visualization and class prediction. CONTACT: komura@hal.rcast.u-tokyo.ac.jp.
机译:动机:由于DNA微阵列实验为我们提供了大量的基因表达数据,因此应使用统计方法对其进行分析,以提取实验结果的含义。一些降维方法(例如主成分分析(PCA))用于大致可视化高维基因表达数据的分布。但是,对于基因表达数据的二进制分类,选择轴时PCA不利用类别信息。因此,在原始空间中明显可分离的数据在PCA中使用的缩小空间中可能不是这样。结果:为了进行基因表达数据的可视化和类预测,我们开发了一种新的基于SVM的方法,称为多维SVM,可生成多个正交轴。此方法将高维数据投影到低维空间中,以清晰显示数据的属性并大致可视化数据的分布。此外,多个轴可以用于类别预测。我们的方法保留了常规SVM的基本属性:数学编程的解决方案很少,而非线性分类是通过使用内核函数隐式实现的。我们的方法在通过实验获得的患者样本基因表达数据集上的应用表明,我们的算法对于可视化和类别预测是有效且有用的。联系人:komura@hal.rcast.u-tokyo.ac.jp。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号