首页> 外文会议>Pacific Rim international conference on artificial intelligence >Clustering of Small-Sample Single-Cell RNA-Seq Data via Feature Clustering and Selection
【24h】

Clustering of Small-Sample Single-Cell RNA-Seq Data via Feature Clustering and Selection

机译:通过特征聚类和选择聚类小样本单细胞RNA-Seq数据

获取原文

摘要

We present FeatClust, a software tool for clustering small sample size single-cell RNA-Seq datasets. The FeatClust approach is based on feature selection. It divides features into several groups by performing agglomerative hierarchical clustering and then iteratively clustering the samples and removing features belonging to groups with the least variance across samples. The optimal number of feature groups is selected based on silhouette analysis on the clustered data, i.e., selecting the clustering with the highest average silhouette coefficient. FeatClust also allows one to visually choose the number of clusters if it is not known, by generating silhouette plot for a chosen number of groupings of the dataset. We cluster five small sample single-cell RNA-seq datasets and use the adjusted rand index metric to compare the results with other clustering packages. The results are promising and show the effectiveness of FeatClust on small sample size datasets.
机译:我们介绍了FeatClust,这是一种用于对小样本大小的单细胞RNA-Seq数据集进行聚类的软件工具。 FeatClust方法基于功能选择。它通过执行聚集层次聚类,然后迭代地对样本进行聚类,并删除样本中方差最小的组中的特征,将特征分为几组。基于对聚类数据的轮廓分析来选择最佳数量的特征组,即,选择具有最高平均轮廓系数的聚类。 FeatClust还可以通过为数据集的选定数量的分组生成轮廓图,在不知道的情况下直观地选择群集的数目。我们对五个小样本单细胞RNA-seq数据集进行聚类,并使用调整后的rand指标来将结果与其他聚类包进行比较。结果令人鼓舞,并显示了FeatClust在小样本数据集上的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号