首页> 外文会议>IEEE International Symposium on Bioinformatics and Bioengineering >HAMMER Algorithm: Hashing with Arithmetic Modulo-4 for Motif Extraction of Regulatory Elements
【24h】

HAMMER Algorithm: Hashing with Arithmetic Modulo-4 for Motif Extraction of Regulatory Elements

机译:锤击算法:用算术模式-4散列用于监管元素的基序提取

获取原文

摘要

A new algorithm, HAMMER, discovers cis-elements in promoter regions of the co-regulated genes. We show that HAMMER is faster and more accurate than well-known tools currently in use to identify cis-elements. Given input sequences that represent promoter regions of genes, this algorithm searches for subsequences of desired length w whose frequency of occurrence is relatively high, while accounting for slightly corrupted variants (with up to d substitutions). Various w-mers are numerically encoded and represented in a hash table, and d-neighbors are efficiently discovered using a modulo-4 arithmetic operation. Profile matrices are constructed and evaluated using a high-order Markov model based on background data (from a gene database). HAMMER discovers the most frequently occurring w-mers (permitting corruption in at most d positions). Experiment results show that HAMMER is significantly faster and discovers more motifs present in the test sequences, when compared with two well-known motif-discovery tools (MDScan and AlignACE).
机译:一种新的算法,锤子,发现共调节基因的启动子区域中的顺式元素。我们表明锤子比目前用于识别顺式元素的知名工具更快,更准确地更准确。给定代表基因的启动子区域的输入序列,该算法搜索所需长度的延续的频率相对较高,同时占略损坏的变体(最多d级)。各种W-MERS在数值上编码并在散列表中表示,并且使用模4算术运算有效地发现D邻居。通过基于背景数据(来自基因数据库)来构造和评估轮廓矩阵并使用高阶Markov模型进行评估。锤子发现最常见的W-MERS(允许最多D位置腐败)。实验结果表明,与两个公知的图案发现工具(MDSCAN和对准)相比,锤子显着更快,并发现测试序列中存在的更多主题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号