An Efficient Keyword Based Search of Big Data Using Map Reduce

P. Srinivasa Rao; M. H. M. Krishna Prasad; K. Thammi Reddy

首页> 外文期刊>Journal of Advances in Information Technology >An Efficient Keyword Based Search of Big Data Using Map Reduce

【24h】

An Efficient Keyword Based Search of Big Data Using Map Reduce

机译：基于基于关键字的基于关键字使用地图减少的大数据搜索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

With the arrival of the data deluge, traditional and centralized tools used to extract knowledge from data become obsolete due to their limited ability to handle massive data. To cope with the need for scalable solutions, a new framework has emerged: Hadoop, an open-source ecosystem designed for storage and large-scale processing work on a cluster of commodity hardware. In order to overcome the limitations in key word based information retrieval systems, an efficient methodology has been designed. A system with the new approach mimics the real world, where every task is laced with certain indexing as this is basic idea behind knowledge processing. Hadoop and R: open source frame works for storing and processing large datasets, are used for preprocessing the text documents. First, a set of text documents are considered. Preprocessing is performed on a large domain of data using R. This includes the removal of the stop words along with stemming and excluding less frequency words. Despite this preprocessing, owing to the colossal number of index terms still floating in the considered domain data, the problem of high dimensionality is encountered. Therefore the dimensionality of such a group of terms is reduced by incorporating a keyword based methodology in Hadoop MapReduce Framework. The developed Model is useful for processing the query which gives us the relevant information with low response time from the data pool considered.

机译：随着数据策划的到来，由于其处理大规模数据的能力有限，用于从数据中提取来自数据的知识的传统和集中式工具已经过时。为了应对可扩展解决方案的需求，出现了一个新的框架：Hadoop，这是一个用于存储和大规模加工在商品硬件集群上的开源生态系统。为了克服基于关键词的信息检索系统中的限制，设计了有效的方法。一个具有新方法的系统模仿现实世界，每个任务都会使用某些索引，因为这是知识处理背后的基本思想。 Hadoop和R：开源帧用于存储和处理大型数据集的工作，用于预处理文本文档。首先，考虑一组文本文档。在使用R的大型数据域上执行预处理。这包括去除停止单词以及诸如诸多频率的单词。尽管这种预处理，但由于仍然浮现在所考虑的域数据中的指数术语的巨大数量，遇到了高维度的问题。因此，通过在Hadoop MakReduce框架中包含基于关键字的方法，减少了这类术语的维度的维度。开发的模型对于处理查询是有用的，这向我们提供了从所考虑的数据池中具有低响应时间的相关信息。

著录项

来源
《Journal of Advances in Information Technology》 |2017年第4期|共6页
作者
P. Srinivasa Rao; M. H. M. Krishna Prasad; K. Thammi Reddy;
展开▼
作者单位

Department of CSE MVGRCE Vizianagaram;

Department of CSE JNTUK Kakinada;

Department of CSE GITAM University Visakhapatnam;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计量学;
关键词
Hadoop; MapReduce; Bigdata; HDFS; information retrieval systems;

机译：Hadoop;Mapreduce;BigData;HDFS;信息检索系统;

相似文献

外文文献
中文文献
专利

1. An Efficient Keyword Based Search of Big Data Using Map Reduce [J] . P. Srinivasa Rao, M. H. M. Krishna Prasad, K. Thammi Reddy Journal of Advances in Information Technology . 2017,第3a4期

机译：基于基于关键字的基于关键字使用地图减少的大数据搜索
2. An extended chaotic maps-based keyword search scheme over encrypted data resist outside and inside keyword guessing attacks in cloud storage services [J] . Li Chun-Ta, Lee Chin-Wen, Shen Jau-Ji Nonlinear dynamics . 2015,第3期

机译：在加密数据上扩展的基于混沌地图的关键字搜索方案可抵御云存储服务中的外部和内部关键字猜测攻击
3. An Efficient Relational Database Keyword Search Scheme Based on Combined Candidate Network Evaluation [J] . Ding Guohui, Sun Haohan, Li Jiajia, Quality Control, Transactions . 2020,第期

机译：基于组合候选网络评估的高效关系数据库关键字搜索方案
4. Distributed SLCA-Based XML Keyword Search by Map-Reduce [C] . Chenjing Zhang, Qiang Ma, Xiaoling Wang, DASFAA 2010;International conference on database systems for advances applications;International workshop on graph data management: Techniques and application;GDM 2010;International workshop on benchmarking of database management systems and data-oriented web technologies;BenchmarX 2010;International workshop on managing data quality in collaborative information systems;MCIS 2010;Workshop on social networks and social media mining on the web;SNSMW 2010;Data-intensive eScience workshop;DIEW 2010;International workshop on ubiquitous data management;UDM 2010 . 2010

机译：通过Map-Reduce分布式基于SLCA的XML关键字搜索
5. Efficient data management and keyword-based association discovery on graph data of large scale. [D] . Zhou, Mo. 2014

机译：大规模图形数据的高效数据管理和基于关键字的关联发现。
6. Hidden Policy Attribute-Based Data Sharing with Direct Revocation and Keyword Search in Cloud Computing [O] . Axin Wu, Dong Zheng, Yinghui Zhang, 2018

机译：云计算中具有基于直接策略和关键字搜索的基于隐藏策略属性的数据共享
7. An Efficient Keyword Based Search of Big Data Using Map Reduce [O] . P. Srinivasa Rao, M. H. M. Krishna Prasad, K. Thammi Reddy 2017

机译：基于基于关键字的基于关键字使用地图减少的大数据搜索

An Efficient Keyword Based Search of Big Data Using Map Reduce

摘要

著录项

相似文献

相关主题

期刊订阅