首页> 外文期刊>Bioinformatics >Mining significant tree patterns in carbohydrate sugar chains.
【24h】

Mining significant tree patterns in carbohydrate sugar chains.

机译:在碳水化合物糖链中挖掘重要的树型。

获取原文
获取原文并翻译 | 示例
           

摘要

MOTIVATION: Carbohydrate sugar chains or glycans, the third major class of macromolecules, hold branch shaped tree structures. Glycan motifs are known to be two types: (1) conserved patterns called 'cores' containing the root and (2) ubiquitous motifs which appear in external parts including leaves and are distributed over different glycan classes. Finding these glycan tree motifs is an important issue, but there have been no computational methods to capture these motifs efficiently. RESULTS: We have developed an efficient method for mining motifs or significant subtrees from glycans. The key contribution of this method is: (1) to have proposed a new concept, 'a-closed frequent subtrees', and an efficient method for mining all these subtrees from given trees and (2) to have proposed to apply statistical hypothesis testing to rerank the frequent subtrees in significance. We experimentally verified the effectiveness of the proposed method using real glycans: (1)We examined the top 10 subtrees obtained by our method at some parameter setting and confirmed that all subtrees are significant motifs in glycobiology. (2) We applied the results of our method to a classification problem and found that our method outperformed other competing methods, SVM with three different tree kernels, being all statistically significant. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
机译:动机:碳水化合物的糖链或聚糖是大分子的第三大类,具有树枝状的树状结构。已知的聚糖基序有两种类型:(1)包含根的称为“核心”的保守模式;(2)普遍存在的基序出现在包括叶在内的外部部分,并分布在不同的聚糖类别上。找到这些聚糖树图案是一个重要的问题,但是还没有计算方法可以有效地捕获这些图案。结果:我们已经开发出一种有效的方法来从聚糖中挖掘基序或重要的亚树。该方法的主要贡献是:(1)提出了一个新概念“ a-closed频繁子树”,以及一种从给定树中挖掘所有这些子树的有效方法,以及(2)提出了应用统计假设检验的方法重新排列频繁出现的子树的意义。我们通过实验验证了使用真实聚糖的方法的有效性:(1)我们检查了通过我们的方法在某些参数设置下获得的前10个亚树,并确认所有亚树都是糖生物学中的重要基序。 (2)我们将我们的方法的结果应用于分类问题,发现我们的方法优于具有其他竞争性方法的SVM(具有三个不同的树核),在统计上都是显着的。补充信息:补充数据可从Bioinformatics在线获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号