首页> 美国卫生研究院文献>Nucleic Acids Research >Comprehensive discovery of DNA motifs in 349 human cells and tissues reveals new features of motifs
【2h】

Comprehensive discovery of DNA motifs in 349 human cells and tissues reveals new features of motifs

机译:在349种人类细胞和组织中全面发现DNA图案揭示了这些图案的新特征

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Comprehensive motif discovery under experimental conditions is critical for the global understanding of gene regulation. To generate a nearly complete list of human DNA motifs under given conditions, we employed a novel approach to de novo discover significant co-occurring DNA motifs in 349 human DNase I hypersensitive site datasets. We predicted 845 to 1325 motifs in each dataset, for a total of 2684 non-redundant motifs. These 2684 motifs contained 54.02 to 75.95% of the known motifs in seven large collections including TRANSFAC. In each dataset, we also discovered 43 663 to 2 013 288 motif modules, groups of motifs with their binding sites co-occurring in a significant number of short DNA regions. Compared with known interacting transcription factors in eight resources, the predicted motif modules on average included 84.23% of known interacting motifs. We further showed new features of the predicted motifs, such as motifs enriched in proximal regions rarely overlapped with motifs enriched in distal regions, motifs enriched in 5′ distal regions were often enriched in 3′ distal regions, etc. Finally, we observed that the 2684 predicted motifs classified the cell or tissue types of the datasets with an accuracy of 81.29%. The resources generated in this study are available at .
机译:在实验条件下全面的基序发现对于全球了解基因调控至关重要。为了在给定条件下生成几乎完整的人类DNA主题列表,我们采用了一种新颖的方法从头发现349个人类DNase I超敏位点数据集中的重要共现DNA主题。我们预测了每个数据集中的845至1325个图案,共2684个非冗余图案。在包括TRANSFAC在内的七个大型馆藏中,这2684个主题包含已知主题的54.02%至75.95%。在每个数据集中,我们还发现了43663至2013288个基序模块,这些基序组及其结合位点同时出现在大量的短DNA区域中。与八种资源中已知的相互作用转录因子相比,预测的基序模块平均包含84.23%的已知相互作用基序。我们进一步展示了预测的基序的新特征,例如富集在近端区域的基序很少与富集在远端区域的基序重叠,富集在5'远端区域的基序经常富集在3'远端区域等,最后,我们观察到2684个预测的基序对数据集的细胞或组织类型进行了分类,准确性为81.29%。这项研究产生的资源可在上找到。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号