首页> 外文期刊>Biomolecular engineering >Improving cluster-based missing value estimation of DNA microarray data.
【24h】

Improving cluster-based missing value estimation of DNA microarray data.

机译:改进基于簇的DNA微阵列数据缺失值估计。

获取原文
获取原文并翻译 | 示例
       

摘要

We present a modification of the weighted K-nearest neighbours imputation method (KNNimpute) for missing values (MVs) estimation in microarray data based on the reuse of estimated data. The method was called iterative KNN imputation (IKNNimpute) as the estimation is performed iteratively using the recently estimated values. The estimation efficiency of IKNNimpute was assessed under different conditions (data type, fraction and structure of missing data) by the normalized root mean squared error (NRMSE) and the correlation coefficients between estimated and true values, and compared with that of other cluster-based estimation methods (KNNimpute and sequential KNN). We further investigated the influence of imputation on the detection of differentially expressed genes using SAM by examining the differentially expressed genes that are lost after MV estimation. The performance measures give consistent results, indicating that the iterative procedure of IKNNimpute can enhance the prediction ability of cluster-based methods in the presence of high missing rates, in non-time series experiments and in data sets comprising both time series and non-time series data, because the information of the genes having MVs is used more efficiently and the iterative procedure allows refining the MV estimates. More importantly, IKNN has a smaller detrimental effect on the detection of differentially expressed genes.
机译:我们提出了一种基于加权估计数据重用的加权K最近邻插补方法(KNNimpute),用于微阵列数据中的缺失值(MV)估计。该方法称为迭代KNN插补(IKNNimpute),因为使用最近估算的值迭代执行估算。通过归一化均方根误差(NRMSE)和估计值与真实值之间的相关系数,在不同条件下(数据类型,缺失数据的结构和缺失数据)评估IKNNimpute的估计效率,并将其与其他基于聚类的估计值进行比较估计方法(KNNimpute和顺序KNN)。我们通过检查MV估计后丢失的差异表达基因,进一步调查了归因法对使用SAM检测差异表达基因的影响。这些性能指标给出了一致的结果,表明IKNNimpute的迭代过程可以提高在高丢失率,非时间序列实验以及包含时间序列和非时间的数据集中存在的基于聚类方法的预测能力系列数据,因为具有MV的基因信息得到更有效的利用,并且迭代过程可以完善MV估算值。更重要的是,IKNN对差异表达基因的检测具有较小的有害作用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号