首页> 外文期刊>International journal of computers, communications and control >Optimal Data File Allocation for All-to-All Comparison in Distributed System: A Case Study on Genetic Sequence Comparison
【24h】

Optimal Data File Allocation for All-to-All Comparison in Distributed System: A Case Study on Genetic Sequence Comparison

机译:分布式系统中全部比较的最佳数据文件分配:以遗传序列比较为例

获取原文
           

摘要

In order to solve the problem of unbalanced load of data les in large-scale data all-to-all comparison under distributed system environment, the differences of les themselves arefully considered. This paper aims to fully utilize the advantages of distributed system to enhance the le allocation of all-to-all comparison between the data les in a large dataset. For this purpose, the author formally described the all-to-all comparison problem, and con-structed a data allocation model via mixed integer linear programming (MILP). Meanwhile, a data allocation algorithm was developed on the Matlab using the intlinprog function of branch-and-bound method. Finally, our model and algorithm were veried through several experiments. The results show that the proposed le allocation strategy can achieve the basic load balance of each node in the distributed system without exceeding the storage capacity of any node, and completely localize the data le. The research ndings can be applied to such elds as bioinformatics, biometrics and data mining.
机译:为了解决分布式系统环境下大规模数据全部比较中数据文件负载不平衡的问题,充分考虑了文件本身的差异。本文旨在充分利用分布式系统的优势来增强大型数据集中数据文件之间的全部比较的文件分配。为此,作者正式描述了全部比较问题,并通过混合整数线性规划(MILP)构建了数据分配模型。同时,利用分支定界方法的intlinprog函数在Matlab上开发了一种数据分配算法。最后,通过几次实验验证了我们的模型和算法。结果表明,所提出的文件分配策略可以在不超过任何节点的存储容量的情况下,实现分布式系统中每个节点的基本负载均衡,并完全定位数据文件。研究结果可应用于生物信息学,生物识别和数据挖掘等领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号