Research on Deep Web Query Interface Clustering Based on Hadoop

Baohua Qiang1; Rui Zhang2; Yufeng Wang3; Qian He1; Wei Li1; Sai Wang1

首页> 外文期刊>Journal of software >Research on Deep Web Query Interface Clustering Based on Hadoop

【24h】

Research on Deep Web Query Interface Clustering Based on Hadoop

机译：基于Hadoop的深网络查询界面聚类研究

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

How to cluster different query interfaceseffectively is one of the most core issues when generatingintegrated query interface on Deep Web integration domain.However, with the rapid development of Internet technology,the number of Deep Web query interface shows an explosivegrowth trend. For this reason, the traditional stand-aloneDeep Web query interface clustering approaches encounterbottlenecks in terms of time complexity and spacecomplexity. After further study of the Hadoop distributedplatforms and Map Reduce programming model, a DeepWeb query interface clustering algorithm based on Hadoopplatform is designed and implemented, in which the VectorSpace Model (VSM) and Latent Semantic Analysis (LSA)are employed to represent “Query Interfaces-Attributes”relationships. The experimental results show that theproposed algorithm has better scalability and speedup ratioby using Hadoop architecture.

机译：如何群集不同的查询interfaceffective是在深网络集成域生成interograted查询接口时最核心问题之一。然而，随着互联网技术的快速发展，深网络查询界面的数量显示了爆炸性的趋势。因此，在时间复杂度和间歇分解性方面，传统的展台网络查询界面群集接近EncounterBottLenecks。在进一步研究HADOOP分配表和地图缩小编程模型之后，设计并实现了一种基于HACoopPlatform的DeepWeb查询界面聚类算法，其中使用Vectorspace模型（VSM）和潜在语义分析（LSA）来表示“查询接口 - 属性“关系”。实验结果表明，有关算法使用Hadoop架构具有更好的可扩展性和加速Ratioby。

著录项

来源
《Journal of software》 |2014年第12期|共6页
作者
Baohua Qiang1; Rui Zhang2; Yufeng Wang3; Qian He1; Wei Li1; Sai Wang1;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Research on Deep Web Query Interface Clustering Based on Hadoop [J] . Baohua Qiang, Rui Zhang, Yufeng Wang, Journal of software . 2014,第12期

机译：基于Hadoop的深度Web查询接口聚类研究。
2. Research on Deep Web Query Interface Clustering Based on Hadoop [J] . Baohua Qiang, Rui Zhang, Yufeng Wang, Journal of Computers . 2014,第12期

机译：基于Hadoop的深网络查询界面聚类研究
3. Multi-objective optimization integration of query interfaces for the Deep Web based on attribute constraints [J] . Yanni Li, Yuping Wang, Peng Jiang, Data & Knowledge Engineering . 2013,第jula期

机译：基于属性约束的Deep Web查询接口的多目标优化集成
4. An interactive clustering-based approach to integrating source query interfaces on the deep Web [C] . Wensheng Wu, Clement Yu, AnHai Doan, ACM SIGMOD international conference on Management of data . 2004

机译：基于交互式群集的方法，用于在深度Web上集成源查询接口
5. Data intensive query processing for Semantic Web data using Hadoop and MapReduce. [D] . Husain, Mohammad Farhan. 2011

机译：使用Hadoop和MapReduce对语义Web数据进行数据密集型查询处理。
6. GExplore 1.4: An expanded web interface for queries on Caenorhabditis elegans protein and gene function [O] . Harald Hutter, Jinkyo Suh 2016

机译：GExplore 1.4：一个扩展的Web界面用于查询秀丽隐杆线虫蛋白质和基因功能
7. Research on Deep Web Query Interface Clustering Based on Hadoop [O] . Baohua Qiang, Rui Zhang, Yufeng Wang, 2015

机译：基于Hadoop的Deep Web查询接口聚类研究
8. UMass at TREC WEB 2014: Entity Query Feature Expansion using Knowledge Base Links. [R] . Dietz, L., Verga, P. 2014

机译：TREC WEB 2014的Umass：使用知识库链接进行实体查询功能扩展。

Research on Deep Web Query Interface Clustering Based on Hadoop

摘要

著录项

相似文献

相关主题

期刊订阅