...
首页> 外文期刊>Knowledge and Information Systems >MotifMiner: Efficient discovery of common substructures in biochemical molecules
【24h】

MotifMiner: Efficient discovery of common substructures in biochemical molecules

机译:MotifMiner:生化分子中常见亚结构的有效发现

获取原文
获取原文并翻译 | 示例

摘要

Biochemical research often involves examining structural relationships in molecules since scientists strongly believe in the causal relationship between structure and function. Traditionally, researchers have identified these patterns, or motifs, manually using domain expertise. However, with the massive influx of new biochemical data and the ability to gather data for very large molecules, there is great need for techniques that automatically and efficiently identify commonly occurring structural patterns in molecules. Previous automated substructure discovery approaches have each introduced variations of similar underlying techniques and have embedded domain knowledge. While doing so improves performance for the particular domain, this complicates extensibility to other domains. Also, they do not address scalability or noise, which is critical for macromolecules such as proteins. In this paper, we present MotifMiner, a general framework for efficiently identifying common motifs in most scientific molecular datasets. The approach combines structure-based frequent-pattern discovery with search space reduction and coordinate noise handling. We describe both the framework and several algorithms as well as demonstrate the flexibility of our system by analyzing protein and drug biochemical datasets.
机译:由于科学家坚信结构与功能之间的因果关系,因此生化研究通常涉及检查分子中的结构关系。传统上,研究人员已使用领域专业知识手动识别了这些模式或图案。但是,随着大量新生化数据的涌入以及收集非常大分子的数据的能力,迫切需要能够自动,有效地识别分子中常见结构模式的技术。先前的自动子结构发现方法都引入了类似基础技术的变体,并且具有嵌入式领域知识。虽然这样做可以提高特定域的性能,但这会使对其他域的可扩展性复杂化。而且,它们也没有解决可伸缩性或噪声,这对于诸如蛋白质的大分子至关重要。在本文中,我们介绍了MotifMiner,这是一个有效识别大多数科学分子数据集中常见基序的通用框架。该方法将基于结构的频繁模式发现与搜索空间缩减和坐标噪声处理相结合。我们描述了框架和几种算法,并通过分析蛋白质和药物生化数据集证明了我们系统的灵活性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号