...
首页> 外文期刊>Cluster computing >SmallClient for big data: an indexing framework towards fast data retrieval
【24h】

SmallClient for big data: an indexing framework towards fast data retrieval

机译:大数据的小节:快速数据检索的索引框架

获取原文
获取原文并翻译 | 示例
           

摘要

Numerous applications are continuously generating massive amount of data and it has become critical to extract useful information while maintaining acceptable computing performance. The objective of this work is to design an indexing framework which minimizes indexing overhead and improves query execution and data search performance with optimum aggregation of computing performance. We propose SmallClient, an indexing framework to speed up query execution. SmallClient has three modules: block creation, index creation and query execution. Block creation module supports improving data retrieval performance with minimum data uploading overhead. Index creation module allows maximum indexes on a dataset to increase index hit ratio with minimized indexing overhead. Finally, query execution module offers incoming queries to utilize these indexes. The evaluation shows that SmallClient outperforms Hadoop full scan with more than 90% search performance. Meanwhile, indexing overhead of SmallClient is reduced to approximately 50 and 80% for index size and indexing time respectively.
机译:许多应用程序正在连续地产生大量数据,并且在保持可接受的计算性能的同时提取有用信息变得至关重要。这项工作的目的是设计一个索引框架,最大限度地减少索引开销,并通过最佳聚集来改进查询执行和数据搜索性能。我们提出SmallClient,一个索引框架来加快查询执行。 SmallClient有三个模块:块创建,索引创建和查询执行。 Block Creation模块支持使用最小数据上传开销,从而提高数据检索性能。索引创建模块允许在数据集上最大索引,以提高索引命中率,最小化索引开销。最后,查询执行模块提供要使用这些索引的传入查询。评估表明,小型线装优于Hadoop全面扫描,搜索性能超过90%。同时,对于索引尺寸和索引时间,小型电闸的索引开销减少到大约50%和80%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号