首页> 美国卫生研究院文献>Nucleic Acids Research >Mining Bacillus subtilis chromosome heterogeneities using hidden Markov models
【2h】

Mining Bacillus subtilis chromosome heterogeneities using hidden Markov models

机译:使用隐马尔可夫模型挖掘枯草芽孢杆菌染色体异质性

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We present here the use of a new statistical segmentation method on the Bacillus subtilis chromosome sequence. Maximum likelihood parameter estimation of a hidden Markov model, based on the expectation-maximization algorithm, enables one to segment the DNA sequence according to its local composition. This approach is not based on sliding windows; it enables different compositional classes to be separated without prior knowledge of their content, size and localization. We compared these compositional classes, obtained from the sequence, with the annotated DNA physical map, sequence homologies and repeat regions. The first heterogeneity revealed discriminates between the two coding strands and the non-coding regions. Other main heterogeneities arise; some are related to horizontal gene transfer, some to t-enriched composition of hydrophobic protein coding strands, and others to the codon usage fitness of highly expressed genes. Concerning potential and established gene transfers, we found 9 of the 10 known prophages, plus 14 new regions of atypical composition. Some of them are surrounded by repeats, most of their genes have unknown function or possess homology to genes involved in secondary catabolism, metal and antibiotic resistance. Surprisingly, we notice that all of these detected regions are a + t-richer than the host genome, raising the question of their remote sources.
机译:我们在此介绍枯草芽孢杆菌染色体序列上新的统计分割方法的使用。基于期望最大化算法,隐马尔可夫模型的最大似然参数估计使人们能够根据其局部组成来分割DNA序列。这种方法不是基于滑动窗口的;它使不同的合成类可以分离,而无需事先知道它们的内容,大小和位置。我们将这些从序列中获得的组成类别与带注释的DNA物理图谱,序列同源性和重复区域进行了比较。第一异质性揭示了在两个编码链和非编码区之间的区别。其他主要的异质性出现了。一些与水平基因转移有关,一些与疏水蛋白编码链的t-富集组成有关,另一些与高表达基因的密码子使用适应性有关。关于潜在的和确定的基因转移,我们发现了10个已知的噬菌体中的9个,以及14个非典型组成的新区域。它们中的一些被重复序列包围,它们的大多数基因功能未知或与涉及继发性分解代谢,金属和抗生素抗性的基因具有同源性。令人惊讶的是,我们注意到所有这些检测到的区域都比宿主基因组富+ t,这提出了其遥远来源的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号