首页> 外文会议>Chinese Control Conference >Unsupervised binning of metagenomic datasets using cluster size insensitive fuzzy c-means method
【24h】

Unsupervised binning of metagenomic datasets using cluster size insensitive fuzzy c-means method

机译:使用簇大小不敏感的模糊c均值方法对宏基因组数据集进行无监督分箱

获取原文

摘要

Metagenomics uses random shotgun sequencing technology to sequence the genomes from uncultured microbial communities. Metagenomic dataset is a kind of unbalanced dataset, as it has a characteristic of uneven species abundance ratio. Fuzzy c-means method (FCM) is a famous clustering method but has a poor performance for unbalanced dataset. This paper introduces a cluster size insensitive fuzzy c-means method (csiFCM) to binning of mategenomic datasets with uneven abundance ratio. Experiment results on synthetic datasets illustrate that csiFCM is an effective binning method for metagenomic dataset and has an overall better performance than AbundanceBin and MetaCluster3.0.
机译:Metagenomics使用随机shot弹枪测序技术对未培养微生物群落的基因组进行测序。元基因组数据集是一种不平衡数据集,具有物种丰富度不均匀的特点。模糊c均值方法(FCM)是一种著名的聚类方法,但对于不平衡数据集的性能较差。本文引入了一种簇大小不敏感的模糊c均值方法(csiFCM)来对丰度比不均匀的Mategenomic数据集进行分箱。综合数据集上的实验结果表明,csiFCM是宏基因组数据集的有效分箱方法,其整体性能优于AbundanceBin和MetaCluster3.0。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号