首页> 外文期刊>International journal of bioinformatics research and applications >A novel distance-based iterative sequential KNN algorithm for estimation of missing values in microarray gene expression data
【24h】

A novel distance-based iterative sequential KNN algorithm for estimation of missing values in microarray gene expression data

机译:一种新型距离基迭代顺序kNN算法,用于估计微阵列基因表达数据中的缺失值

获取原文
获取原文并翻译 | 示例
           

摘要

The presence of missing entries in DNA microarray gene expression datasets creates severe problems in downstream analysis because they require complete datasets. Though several missing value prediction methods have been proposed to solve this problem, they have limitations which may affect the performance of various analysis algorithms. In this regard, a novel distance based iterative sequential K-nearest neighbour imputation method (IS/GSTNimpute) has been proposed. The proposed distance is a hybridisation of modified Euclidean distance and Pearson correlation coefficient. The proposed method is a modification of KNN estimation in which the concept of reuse of estimation is considered using both iterative and sequential approach. The performance of the proposed ISAMNimpute method is tested on various time-series and non time-series microarray datasets comparing with several widely used existing imputation techniques. The experimental results confirm that the ISKNNimpute method consistently generates better results compared to other existing methods.
机译:DNA微阵列基因表达数据集中缺失条目的存在在下游分析中产生严重的问题,因为它们需要完整的数据集。虽然已经提出了几种缺失的值预测方法来解决这个问题,但它们具有限制可能影响各种分析算法的性能。在这方面,已经提出了一种新的基于距离的迭代顺序k最近邻居归属方法(IS / gstnimpute)。所提出的距离是改进的欧几里德距离和Pearson相关系数的杂交。所提出的方法是考虑使用迭代和顺序方法考虑估计的重用概念的kNN估计的修改。建议的ISAMnimmimpute方法的性能在各种时间序列和非时间序列微阵列数据集上进行测试,与若干广泛使用的现有归象技术相比。实验结果证实,与其他现有方法相比,Isknnimpute方法一直产生更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号