首页> 外文会议>International conference on neural information processing >Recent Advances in Improving the Memory Efficiency of the TRIBE MCL Algorithm
【24h】

Recent Advances in Improving the Memory Efficiency of the TRIBE MCL Algorithm

机译:TRIBE MCL算法提高存储效率的最新进展

获取原文

摘要

A fast and highly memory-efficient implementation of the TRIBE-MCL clustering algorithm is proposed to perform the classification of huge protein sequence data sets using an ordinary PC. Improvements compared to previous versions are achieved through adequately chosen data structures that facilitate the efficient handling of symmetric sparse matrices. The proposed algorithm was tested on huge synthetic protein sequence data sets. The validation process revealed that the proposed method extended the data size processable on a regular PC from previously reported 250 thousand to one million items. The algorithm needs 10-20% less time for processing the same data sizes than previous efficient Markov clustering algorithms, without losing anything from the partition quality. The proposed solution is open for further improvement via parallel data processing.
机译:提出了一种快速且高效存储的TRIBE-MCL聚类算法,以使用普通PC对巨大的蛋白质序列数据集进行分类。与先前版本的改进通过有助于对称稀疏矩阵的高效处理适当选择的数据结构来实现的。该算法在庞大的合成蛋白序列数据集上进行了测试。验证过程表明,所提出的方法将可在常规PC上处理的数据大小从先前报告的25万个扩展到了一百万个。与以前的高效Markov聚类算法相比,该算法处理相同数据大小所需的时间减少了10-20%,而不会因分区质量而损失任何东西。所提出的解决方案是开放的,可以通过并行数据处理进行进一步的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号