首页> 外文会议>International Conference on Cloud Computing and Big Data Analytics >Web Service Clustering Method Based on Word Vector and Biterm Topic Model
【24h】

Web Service Clustering Method Based on Word Vector and Biterm Topic Model

机译:基于Word Vector和Biterm主题模型的Web服务聚类方法

获取原文

摘要

The increasing number of Web services brings challenges for accurate and efficient service discovery. Service clustering technology can narrow the retrieval range of service discovery and improve the efficiency of service discovery. However, due to the short text features with few words and difficult feature extraction in service description text, the traditional text clustering model can not get effective clusters. In this paper, a web service clustering method based on word vector and BTM (Biterm Topic Model) is proposed. Firstly, the sparse feature problem of short text is alleviated by expanding the word vector. Secondly, the extended service description text is modeled by the BTM based on Gibbs sampling. Finally, the k-means algorithm is used to cluster web services. Compared with LDA (Latent Dirichlet Allocation) and BTM, this method improves the F-Measure of clustering index. Experimental results show that the proposed method can effectively alleviate the problem of sparse service description text features and improve the clustering effect of services.
机译:越来越多的Web服务为准确和高效的服务发现带来了挑战。服务聚类技术可以缩小服务发现的检索范围,提高服务发现效率。但是,由于服务描述文本中具有几个单词和困难的特征提取的短文本功能,传统的文本聚类模型无法获得有效的群集。本文提出了一种基于Word Vector和BTM(BITERM主题模型)的Web服务聚类方法。首先,通过扩展单词向量来缓解短文本的稀疏功能问题。其次,扩展服务描述文本由基于GIBBS采样的BTM建模。最后,K-means算法用于群集Web服务。与LDA(潜在Dirichlet分配)和BTM相比,该方法改善了聚类指数的F测量。实验结果表明,该方法可以有效缓解稀疏服务描述文本特征的问题,提高服务的聚类效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号