首页> 外文期刊>Journal of Biosciences >In silico comparison of bacterial strains using mutual information
【24h】

In silico comparison of bacterial strains using mutual information

机译:使用互信息对细菌菌株进行计算机分析

获取原文
获取原文并翻译 | 示例
           

摘要

Fast-sequencing throughput methods have increased the number of completely sequenced bacterial genomes to about 400 by December 2006, with the number increasing rapidly. These include several strains. In silico methods of comparative genomics are of use in categorizing and phylogenetically sorting these bacteria. Various word-based tools have been used for quantifying the similarities and differences between entire genomes. The simple di-nucleotide frequency comparison, codon specificity and k-mer repeat detection are among some of the well-known methods. In this paper, we show that the Mutual Information function, which is a measure of correlations and a concept from Information Theory, is very effective in determining the similarities and differences among genome sequences of various strains of bacteria such as the plant pathogen Xylella fastidiosa, marine Cyanobacteria Prochlorococcus marinus or animal and human pathogens such as species of Ehrlichia and Legionella. The short-range three-base periodicity, small sequence repeats and long-range correlations taken together constitute a genome signature that can be used as a technique for identifying new bacterial strains with the help of strains already catalogued in the database. There have been several applications of using the Mutual Information function as a measure of correlations in genomics but this is the first whole genome analysis done to detect strain similarities and differences.
机译:到2006年12月,快速测序通量方法已将完全测序的细菌基因组的数量增加到约400个,并且数量迅速增加。这些包括几种菌株。比较基因组学的计算机方法可用于对这些细菌进行分类和系统分类。各种基于单词的工具已用于量化整个基因组之间的相似性和差异。简单的二核苷酸频率比较,密码子特异性和k-mer重复检测是一些众所周知的方法。在本文中,我们表明互信息功能是一种相关性度量,是信息理论的一个概念,它对于确定各种细菌菌株(如植物病原体Xylella fastidiosa)的基因组序列之间的相似性和差异非常有效。海洋蓝藻海水原球藻或动物和人类病原体,例如埃里希氏菌和军团菌。短程三碱基周期性,小序列重复和长程相关性共同构成了基因组签名,可以用作已在数据库中分类的菌株来鉴定新细菌菌株的技术。使用互信息功能作为基因组学中相关性的度量有多种应用,但这是第一个用于检测菌株相似性和差异的全基因组分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号