【24h】

Discovering Significant Structures in Clustered Bio-molecular Data Through the Bernstein Inequality

机译:通过伯恩斯坦不等式发现聚类生物分子数据中的重要结构

获取原文

摘要

Searching for structures in complex bio-molecular data is a central issue in several branches of bioinformatics. In particular, the reliability of clusters discovered by a given clustering algorithm have been recently assessed through methods based on the concept of stability with respect to random perturbations of the data. In this context, a major problem is to assess the confidence of the measures of reliability. We discuss a partially "distribution independent" method based on the classical Bernstein inequality to assess the statistical significance of the discovered clusterings. Experimental results with gene expression data show the effectiveness of the proposed approach.
机译:在复杂的生物分子数据中寻找结构是生物信息学的几个分支中的核心问题。特别地,最近通过基于关于数据的随机扰动的稳定性的概念来评估由给定聚类算法发现的簇的可靠性。在这种情况下,主要问题是评估可靠性措施的置信度。我们讨论了基于古典伯尔斯坦不等式的部分“分布独立”方法,以评估发现的集群的统计学意义。基因表达数据的实验结果表明了所提出的方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号