首页> 外文会议>Asia-Pacific Bioinformatics Conference >Missinq value imputation for microRNAexpression data by using a GO-based similarity measure
【24h】

Missinq value imputation for microRNAexpression data by using a GO-based similarity measure

机译:使用基于Go的相似度测量,MissInq对MicroRNAExpression数据的价值估算

获取原文

摘要

Background: Missing values are commonly present in microarray data profiles. Instead of discarding genes or samples with incomplete expression level, missing values need to be properly imputed for accurate data analysis. The imputation methods can beroughly categorized as expression level-based and domain knowledge-based. The first type of methods only rely on expression data without the help of external data sources, while the second type incorporates available domain knowledge into expression datato improve imputation accuracy. In recent years, microRNA (miRNA) microarray has been largely developed and used for identifying miRNA biomarkers in complex human disease studies. Similar to mRNA profiles, miRNA expression profiles with missing values can be treated with the existing imputation methods. However, the domain knowledge-based methods are hard to be applied due to the lack of direct functional annotation for miRNAs. With the rapid accumulation of miRNA microarray data, it is increasingly needed to develop domain knowledge-based imputation algorithms specific to miRNA expression profiles to improve the quality of miRNA data analysis.Results: We connect miRNAs with domain knowledge of Gene Ontology (GO) via their target genes, and define miRNA functional similarity based on the semantic similarity of GO terms in GO graphs. A new measure combining miRNA functional similarity and expression similarity is used in the imputation of missing values. The new measure is tested on two miRNA microarray datasets from breast cancer research and achieves improved performance compared with the expression-based method on both datasets.Conclusions: The experimental results demonstrate that the biological domain knowledge can benefit the estimation of missing values in miRNA profiles as well as mRNA profiles. Especially, functional similarity defined by GO terms annotated for the targetgenes of miRNAs can be useful complementary information for the expression-based method to improve the imputation accuracy of miRNA array data. Our method and data are available to the public upon request.
机译:背景:缺失值通常存在于微阵列数据配置文件中。无需丢弃具有不完全表达水平的基因或样品,因此需要正确地抵消缺失的值以进行准确的数据分析。估算方法可以分为基于级别的基于域的表达和域知识。第一种类型的方法仅依赖于表达数据而无需外部数据源,而第二种类型将可用的域知识包含成表达式数据,提高了估算准确性。近年来,MicroRNA(miRNA)微阵列已经大大开发并用于鉴定MiRNA生物标志物在复杂的人类疾病研究中。类似于mRNA配置文件,可以用现有的归属方法处理具有缺失值的miRNA表达谱。然而,由于MiRNA缺乏直接功能注释,难以应用基于域知识的方法。随着MiRNA微阵列数据的快速积累,越来越需要开发特定于MiRNA表达型谱的基于域知识的贬低算法,以提高miRNA数据分析的质量。结果:我们将MiRNA与基因本体(GO)的域名知识连接它们的目标基因,并根据GO图表中的GO术语的语义相似性定义miRNA功能相似性。组合MiRNA功能相似性和表达式相似性的新度量用于缺失值的归属。新措施在两种miRNA微阵列数据集中测试了来自乳腺癌研究的两种miRNA微阵列数据集,与两个数据集中的基于表达式的方法相比,实现了改进的性能。结论:实验结果表明,生物域知识可以使MiRNA简介中缺失值估计以及mRNA配置文件。特别地,由对miRNA的目标针对的GO术语定义的功能相似度可以是用于基于表达式的方法的有用的互补信息,以提高MiRNA阵列数据的归纳精度。我们的方法和数据可根据要求提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号