首页> 外文期刊>Mobile networks & applications >Effective Big Data Retrieval Using Deep Learning Modified Neural Networks
【24h】

Effective Big Data Retrieval Using Deep Learning Modified Neural Networks

机译:使用深度学习修改的神经网络进行有效的大数据检索

获取原文
获取原文并翻译 | 示例
           

摘要

In big data, the information retrieval (IR) embraces the discovery of documents from a compilation of dataset which are related to the user query. Usually, the data retrieval systems are used to scan the data. The existent approaches that employ customary IR are wearisome for big document collections. Recently, IR approaches were developed, albeit these are faster comparing to the traditional method but the quality of the document retrieved is less. To overcome such difficulties, here, effectual big data retrieval utilizing Deep Learning Modified Neural Networks (DLMNN) is proposed. Initially, the general pre-processing along with feature extraction steps are taken place. In preprocessing stage, unwanted data are removed and also used for changing the unstructured data in to structured data then in FE is done using frequency and entropy calculation of the given input data. Secondly, find the closed recurrent item dataset, after that find the weight of provided data using entropy measure and frequent item measure. In the 3rd step, the documents are clustered utilizing the k-means algorithm and then classified using DLMNN. The K-Means algorithm is utilized to partition the collection of documents into several clusters then the DLMNN is used for classifying the documents into positive and negative classes. The proposed DLNN weight parameters are optimized utilizing the cuckoo search (CS) optimization algorithm. The last stage on the training process section is generating a training data-base. In the other part, the retrieval process is performed, in this section it pre-processes the user query and discovers the frequency item set then gets retrieval data. Finally, check the similarity assessment, if the information is found then it is visualized, otherwise the document is returned to the initial position. Experimental results contrasted with the previous MRT and IRI-RAS techniques concerning precision, recall, F-measure along with computation time. The proposed document IR is better when comparing with existent methods.
机译:在大数据中,信息检索(IR)包括从与用户查询相关的数据集汇编中发现文档。通常,数据检索系统用于扫描数据。现有的采用常规IR的方法对于大文件收集来说很烦人。近年来,虽然开发了IR方法,但是与传统方法相比,它们虽然速度更快,但是检索到的文档质量却较低。为了克服这些困难,在这里,提出了使用深度学习修改神经网络(DLMNN)进行有效的大数据检索。首先,进行一般的预处理以及特征提取步骤。在预处理阶段,不需要的数据将被删除,还用于将非结构化数据更改为结构化数据,然后在FE中使用给定输入数据的频率和熵计算来完成。其次,找到封闭的经常性项目数据集,然后使用熵测度和频繁项测度找到所提供数据的权重。第三步,使用k-means算法对文档进行聚类,然后使用DLMNN对其进行分类。利用K-Means算法将文档集合划分为几个群集,然后使用DLMNN将文​​档分类为肯定和否定类别。利用布谷鸟搜索(CS)优化算法对提出的DLNN权重参数进行了优化。培训过程部分的最后一个阶段是生成培训数据库。在另一部分中,执行检索过程,在此部分中,它会预处理用户查询并发现频率项集,然后获取检索数据。最后,检查相似性评估,如果找到了信息,则将其可视化,否则将文档返回到初始位置。实验结果与以前的MRT和IRI-RAS技术在精度,召回率,F度量以及计算时间方面形成了对比。与现有方法相比,建议的文档IR更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号