首页> 外文期刊>Journal of software >Research on Deep Web Query Interface Clustering Based on Hadoop
【24h】

Research on Deep Web Query Interface Clustering Based on Hadoop

机译:基于Hadoop的深网络查询界面聚类研究

获取原文
           

摘要

How to cluster different query interfaceseffectively is one of the most core issues when generatingintegrated query interface on Deep Web integration domain.However, with the rapid development of Internet technology,the number of Deep Web query interface shows an explosivegrowth trend. For this reason, the traditional stand-aloneDeep Web query interface clustering approaches encounterbottlenecks in terms of time complexity and spacecomplexity. After further study of the Hadoop distributedplatforms and Map Reduce programming model, a DeepWeb query interface clustering algorithm based on Hadoopplatform is designed and implemented, in which the VectorSpace Model (VSM) and Latent Semantic Analysis (LSA)are employed to represent “Query Interfaces-Attributes”relationships. The experimental results show that theproposed algorithm has better scalability and speedup ratioby using Hadoop architecture.
机译:如何群集不同的查询interfaceffective是在深网络集成域生成interograted查询接口时最核心问题之一。然而,随着互联网技术的快速发展,深网络查询界面的数量显示了爆炸性的趋势。因此,在时间复杂度和间歇分解性方面,传统的展台网络查询界面群集接近EncounterBottLenecks。在进一步研究HADOOP分配表和地图缩小编程模型之后,设计并实现了一种基于HACoopPlatform的DeepWeb查询界面聚类算法,其中使用Vectorspace模型(VSM)和潜在语义分析(LSA)来表示“查询接口 - 属性“关系”。实验结果表明,有关算法使用Hadoop架构具有更好的可扩展性和加速Ratioby。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号