首页> 外文会议>Distributed Computing Systems Workshops, 2009. ICDCS Workshops '09 >An Index Clustering and Mapping Algorithm for Large Scale Astronomical Data Searching
【24h】

An Index Clustering and Mapping Algorithm for Large Scale Astronomical Data Searching

机译:大规模天文数据搜索的索引聚类和映射算法

获取原文
获取外文期刊封面目录资料

摘要

For large scale unstructured astronomical data documents, the simple index method often results in high communication cost and slow query processing. Based on the characteristics of domain specific astronomical data and the quantitative tracing and analyzing results, a query terms similarity calculation formula is provided. An index clustering algorithm is designed to generate many small clusters with high term association and small real index size which can be stored into different nodes as a whole. To keep high query locality and reasonable load balancing, a practical index mapping algorithm is proposed to map different logical index clusters onto physical nodes. The simulation results show that the algorithms provided in this paper have good scalability for large scale astronomical data index system. Compared with other methods, different queries can be distributed and located onto smaller number of nodes, so communication cost among different nodes can be reduced significantly and the search efficiency could be well improved.
机译:对于大规模的非结构化天文数据文档,简单的索引方法通常会导致较高的通信成本和较慢的查询处理。根据特定领域的天文数据的特点以及定量跟踪和分析结果,提供了一个查询词相似度计算公式。索引聚类算法被设计为生成许多具有高关联性和较小实际索引大小的小聚类,这些聚类可以作为一个整体存储到不同的节点中。为了保持较高的查询局部性和合理的负载均衡,提出了一种实用的索引映射算法,将不同的逻辑索引簇映射到物理节点上。仿真结果表明,本文提出的算法在大规模天文数据索引系统中具有良好的可扩展性。与其他方法相比,可以将不同的查询分布并定位到较少的节点上,从而可以显着降低不同节点之间的通信成本,并可以很好地提高搜索效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号