首页> 外文期刊>Bioinformatics >Identification of compositionally distinct regions in genomes using the centroid method
【24h】

Identification of compositionally distinct regions in genomes using the centroid method

机译:使用质心法鉴定基因组中组成不同的区域

获取原文
获取原文并翻译 | 示例
       

摘要

Motivation: It is known that most genomic regions of special interest, e.g. horizontally acquired sequences, genomic islands, etc. have distinct word (m-mer) compositions. Most of the earlier work along this direction, addressed di- and tri-nucleotide compositions. We present an approach that can be applied to analyze compositions of any given word size. The method, called the centroid approach, can reveal compositionally distinct regions in genomic sequences for any given word size. Results: We applied our method to 50 bacterial genomes and demonstrated its ability to identify embedded sequences of varying lengths from distantly related organisms. We also investigated the genetic makeup of the regions identified as compositionally distinct by our method, for four organisms from our dataset. Pathogenicity island (PAI) components and genes encoding strain-specific proteins are all frequently seen to be constituents of these regions. Program is available on request from the authors. Supplementary information: Supplementary data are available at Bioinformatics online.
机译:动机:已知大多数特别感兴趣的基因组区域,例如水平获得的序列,基因组岛等具有独特的词(m-mer)组成。沿着该方向的大多数早期工作涉及二核苷酸和三核苷酸的组成。我们提出了一种可用于分析任何给定单词大小的成分的方法。该方法称为质心法,可以揭示任何给定字长的基因组序列中组成上不同的区域。结果:我们将我们的方法应用于50个细菌基因组,并证明了其从远距离相关生物中识别长度不同的嵌入序列的能力。我们还针对我们数据集中的四种生物调查了通过我们的方法确定为组成不同的区域的遗传组成。致病性岛(PAI)组件和编码菌株特异性蛋白的基因经常被视为这些区域的组成部分。可应作者要求提供程序。补充信息:补充数据可从Bioinformatics在线获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号