首页> 外国专利> TEXT INFORMATION CLUSTERING METHOD AND TEXT INFORMATION CLUSTERING SYSTEM

TEXT INFORMATION CLUSTERING METHOD AND TEXT INFORMATION CLUSTERING SYSTEM

机译:文本信息聚类方法和文本信息聚类系统

摘要

Embodiments of the disclosure provide a text information clustering method and a text information clustering system. The method can include performing word segmentation on multiple pieces of text information to generate multiple words; performing an initial clustering on the multiple words to generate multiple first-level topics, each of the first-level topics comprising at least two pieces of text information; determining, for each of the first-level topics, a number of second-level topics based on a number of pieces of text information under the first-level topic; and performing, according to the number of second-level topics of each of the first-level topics, a secondary clustering on the multiple words of at least two pieces of text information comprised in the first-level topic to generate multiple second-level topics.
机译:本公开的实施例提供了一种文本信息聚类方法和文本信息聚类系统。该方法可以包括对多条文本信息执行单词分割以生成多个单词;对多个单词进行初始聚类,以生成多个第一级主题,每个第一级主题包括至少两条文本信息;根据第一级主题下的多个文本信息,为每个第一级主题确定多个第二级主题;根据每个第一级主题的第二级主题数,对第一级主题中包含的至少两个文本信息的多个词进行二次聚类,生成多个第二级主题。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号