基于MapReduce的海量文件检索方法研究

谭黔林; 莫春娟

首页> 中文期刊> 《河池学院学报》 >基于MapReduce的海量文件检索方法研究

基于MapReduce的海量文件检索方法研究

AI论文写作 >>

AI期刊论文写作 >>

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In the document retrieval method, the key is built on the database search. However, when the a-mount of data to be retrieved becomes very large, using this search method, a large number of retrieval operations will be concentrated on a single host, which can result in reduced efficiency of retrieval. Under this background, a distributed system can be used to solve the problem. Retrieving resources in a distributed system can be based on MapReduce architecture to achieve retrieval. Thus, the pressure of retrieval operation will be allocated to each node in a distributed system, which can effectively reduce the pressure of the machine and greatly improve the retrieval efficiency. Using the traditional way, retrieving 1 million data consumes 500 seconds, while using the method based on MapReduce architecture for distributed systems to retrieve one million data only needs 40 seconds. Com-pared with traditional search method, method of distributed systems based on MapReduce architecture can promote efficiency to 12. 5 times.%在文件检索的方法中,目前主要是基于数据库进行检索。但是,当待检索的数据量变得非常大的时候,再使用这种检索方式,大量的检索操作就会集中在一台主机上进行,这会导致检索效率降低。基于这种情况,拟采用分布式系统来解决这个问题。在分布式系统中进行资源检索时,可以基于MapReduce架构来实现检索,这样,检索操作的压力将分散到分布式系统的各个节点中,这样可以有效降低机器的压力,大大提高检索的效率。采用传统方式检索100万条数据,需要耗时500 s,而采用基于MapReduce架构的分布式系统的方法来检索100万的数据,只需要花费40 s,相对于传统检索方法采用基于MapReduce架构的分布式系统检索可使检索效率提升接近12．5倍。

著录项

来源
《河池学院学报》 |2016年第2期|101-105|共5页
作者
谭黔林; 莫春娟;
展开▼
作者单位

河池学院计算机与信息工程学院;

广西宜州 546300;

河池学院计算机与信息工程学院;

广西宜州 546300;

展开▼
原文格式 PDF
正文语种 chi
中图分类程序设计、软件工程;
关键词
大数据; MapReduce; 检索; 分布式系统;

相似文献

中文文献
外文文献
专利

1. 基于MapReduce的海量图像检索技术研究 [J] . 朱莹芳 . 长沙民政职业技术学院学报 . 2016,第1期
2. 基于Mapreduce与关联分类挖掘的海量数据分类增量挖掘方法研究 [J] . 何波 . 福建电脑 . 2017,第4期
3. 基于MapReduce技术的海量文本数据统计方法研究 [J] . 宗峰 . 山东英才学院学报 . 2017,第4期
4. 一种基于哈希散列技术进行文件对象存储和检索的方法——海量文件系统数据访问和检索性能加速研究 [J] . 冷迪 . 中国新通信 . 2018,第23期
5. 基于EHDFS的海量小文件存储与检索方法 [J] . 李文武 ,张建锋 ,王景林 . 计算机工程与设计 . 2022,第2期
6. 基于Web的海量视频证据资料存储检索系统开发方法研究 [C] . 李堂恺 ,李志强 ,陈路 . 第八届全国大学生创新创业年会 . 2015
7. 基于MapReduce的海量图像检索技术研究 [A] . 陈广钊 . 2012

基于MapReduce的海量文件检索方法研究

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅