首页> 美国卫生研究院文献>Nucleic Acids Research >Training-free measures based on algorithmic probability identify high nucleosome occupancy in DNA sequences
【2h】

Training-free measures based on algorithmic probability identify high nucleosome occupancy in DNA sequences

机译:基于算法概率的免训练措施可确定DNA序列中的核小体占用率高

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We introduce and study a set of training-free methods of an information-theoretic and algorithmic complexity nature that we apply to DNA sequences to identify their potential to identify nucleosomal binding sites. We test the measures on well-studied genomic sequences of different sizes drawn from different sources. The measures reveal the known versus predictive discrepancies and uncover their potential to pinpoint high and low nucleosome occupancy. We explore different possible signals within and beyond the nucleosome length and find that the complexity indices are informative of nucleosome occupancy. We found that, while it is clear that the gold standard Kaplan model is driven by GC content (by design) and by -mer training; for high occupancy, entropy and complexity-based scores are also informative and can complement the Kaplan model.
机译:我们介绍并研究了信息理论和算法复杂性性质的一套无需训练的方法,该方法适用于DNA序列以鉴定其鉴定核小体结合位点的潜力。我们对来自不同来源的不同大小的经过充分研究的基因组序列进行了测试。这些措施揭示了已知差异与预测差异,并揭示了它们确定高和低核小体占有率的潜力。我们探索了核小体长度之内和之外的不同可能信号,发现复杂性指数对核小体的占有率具有指导意义。我们发现,尽管很明显,金标准的Kaplan模型是由GC含量(通过设计)和-mer培训驱动的;对于高占用率,基于熵和复杂度的评分也很有用,可以补充Kaplan模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号