首页> 外文期刊>IAENG Internaitonal journal of computer science >Optimal Data File Allocation for All-to-All Comparison in Distributed System: A Case Study on Genetic Sequence Comparison
【24h】

Optimal Data File Allocation for All-to-All Comparison in Distributed System: A Case Study on Genetic Sequence Comparison

机译:分布式系统全面比较的最佳数据文件分配:遗传序列比较的案例研究

获取原文
           

摘要

In order to solve the problem of unbalanced load of data les in large-scale data all-to-all comparison under distributed system environment, the differences of les themselves arefully considered. This paper aims to fully utilize the advantages of distributed system to enhance the le allocation of all-to-all comparison between the data les in a large dataset. For this purpose, the author formally described the all-to-all comparison problem, and con-structed a data allocation model via mixed integer linear programming (MILP). Meanwhile, a data allocation algorithm was developed on the Matlab using the intlinprog function of branch-and-bound method. Finally, our model and algorithm were veried through several experiments. The results show that the proposed le allocation strategy can achieve the basic load balance of each node in the distributed system without exceeding the storage capacity of any node, and completely localize the data le. The research ndings can be applied to such elds as bioinformatics, biometrics and data mining.
机译:为了解决分布式系统环境下的大规模数据中的大规模数据中数据量的不平衡负荷的问题,因此LES本身的差异齐全地考虑。本文旨在充分利用分布式系统的优势,增强大型数据集中数据LES之间的全面比较的LE分配。为此目的,作者正式描述了全面的比较问题,并通过混合整数线性编程(MILP)来控制数据分配模型。同时,使用分支和绑定方法的Intlinprog函数在Matlab上开发了数据分配算法。最后,我们的模型和算法通过了几个实验。结果表明,建议的LE分配策略可以在分布式系统中实现每个节点的基本负载余量而不超过任何节点的存储容量,并完全本地化数据LE。研究题可以应用于这种抗体作为生物信息学,生物识别和数据挖掘。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号