首页> 外国专利> Method for clustering nodes of a textual network taking into account textual content, computer-readable storage device and system implementing said method

Method for clustering nodes of a textual network taking into account textual content, computer-readable storage device and system implementing said method

机译:考虑文本内容对文本网络的节点进行聚类的方法,计算机可读存储设备和实现所述方法的系统

摘要

The invention relates to a method for clustering nodes of a network, the network comprising nodes associated with message edges of text data, the method comprising an initialization step of determination of a first initial clustering of the nodes, and a step of iterative inference of a generative model of text documents. Edges are modeled with a Stochastic Block Model (SBM) and the sets of documents between and within clusters are modeled according to a generative model of documents. The inference step comprises iteratively modelling the text documents and the underlying topics of their textual content, and updating the clustering as a function of the modelling, until a convergence criterion is fulfilled and an optimized clustering and corresponding optimized values of the parameters of the models are output.
机译:本发明涉及一种用于对网络的节点进行聚类的方法,该网络包括与文本数据的消息边缘相关联的节点,该方法包括确定节点的第一初始聚类的初始化步骤,以及对网络的迭代推断的步骤。文本文档生成模型。使用随机块模型(SBM)对边进行建模,并根据文档的生成模型对聚类之间和内部的文档集进行建模。推理步骤包括对文本文档及其文本内容的基础主题进行迭代建模,并根据建模功能更新聚类,直到满足收敛标准并且模型的参数的优化聚类和相应的优化值得到满足输出。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号