首页> 外文会议>International Conference on Information Technology and Applications in Biomedicine >A simple clustering approach for pathogenic strain identification based on local and global amino acid compositional signatures from genomic sequences: the Escherichia genus case
【24h】

A simple clustering approach for pathogenic strain identification based on local and global amino acid compositional signatures from genomic sequences: the Escherichia genus case

机译:基于基因组序列的局部和全球氨基酸组成特征的病原菌鉴定简单聚类方法:大肠杆菌壳

获取原文

摘要

Cluster analysis offers a suite of powerful unsupervised methods, commonly used as exploratory data analysis tools. Such tools can be proven especially useful when we face the situation of analyzing large data sets and want to get an intuitive insight at subtle correlations between instances of the data. In this work, we demonstrate that simple hierarchical clustering approaches (based on compositional features extracted from the amino acid sequences encoded in the complete genomic sequences of 25 species/strains belonging to the proteobacterial genus Escherichia) can be used to accurately discriminate between pathogenic and non-pathogenic strains of those bacteria.
机译:集群分析提供了一套强大的无监督方法,通常用作探索性数据分析工具。当我们面对分析大数据集的情况并希望在数据的情况之间进行直观相关性时,可以证明这些工具特别有用。在这项工作中,我们证明了简单的层次聚类方法(基于从属于植物属植物属植物属植物属植物属植物属植物的25种/菌株的完全基因组序列中提取的组成特征)可用于精确区分致病和非 - 那些细菌的菌株。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号