首页> 外文会议>International Conference on Advances in Computing, Communications and Informatics >An improved fuzzy based approach to impute missing values in DNA microarray gene expression data with collaborative filtering
【24h】

An improved fuzzy based approach to impute missing values in DNA microarray gene expression data with collaborative filtering

机译:一种改进的基于模糊的基于模糊的方法,以通过协同过滤施加DNA微阵列基因表达数据中缺失值

获取原文

摘要

DNA microarray experiments normally generate gene expression profiles in the form of high dimensional matrices. It may happen that DNA microarray gene expression values contain many missing values within its data due to several reasons like image disruption, hybridization error, dust, moderate resolution etc. It will be very unfortunate if these missing values affect the performance of subsequent statistical and machine learning experiments significantly. There exist various missing value estimation algorithms. In this work we have proposed a modification to the existing imputation approach named as Collaborative Filtering Based on Rough-Set Theory (CFBRST) [10]. This proposed approach (CFBRSTFDV) uses Fuzzy Difference Vector (FDV) along with Rough Set based Collaborative Filtering that analyzes historical interactions and helps to estimate the missing values. This is a suggestion based system that works on the principle of how suggestion of items or products arrive to an individual while using FB, Twitter or looking for books in Amazon. We have applied our proposed algorithm on two benchmark dataset SPELLMAN & Tumor Cell (GDS2932) and the experiments show that the modified approach, CFBRSTFDV, outperforms the other existing state-of-the art methods as far as RMSE measures are concerned, particularly when we increase the number of missing values.
机译:DNA微阵列实验通常以高尺寸基质形式产生基因表达谱。由于图像中断,杂交误差,灰尘,中度分辨率等的几个原因,DNA微阵列基因表达值在其数据中包含许多缺失值。如果这些缺失值会影响随后的统计和机器的性能,那将是非常不幸的学习实验显着。存在各种缺失值估计算法。在这项工作中,我们提出了基于粗糙设置理论(CFBRST)的现有估算方法的修改,以基于粗糙的理论(CFBRST)[10]。这一提出的方法(CFBRSTFDV)使用模糊差载体(FDV)以及基于粗糙集的协作滤波,分析了历史交互,并有助于估计缺失的值。这是一个基于建议的系统,其原则是如何在使用FB,Twitter或寻找亚马逊书籍时如何到达个人的原则。我们在两个基准数据集法术师和肿瘤细胞(GDS2932)上应用了我们提出的算法(GDS2932),实验表明,目前修改方法CFBRSTFDV,表明,就RMSE措施而言,其他现有最先进的方法尤其如此增加缺失值的数量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号