首页> 外国专利> Method and server for extracting topic and evaluating suitability of the extracted topic

Method and server for extracting topic and evaluating suitability of the extracted topic

机译:提取主题并评估所提取主题的适用性的方法和服务器

摘要

A method and a server for extracting a topic and evaluating suitability of the extracted topic are disclosed. The topic extraction server includes a text preprocessing unit configured to extract noun from a document group and remove stopword from the extracted noun, a keyword extraction unit configured to calculate a weight of a noun and extracting a keyword representing the document group, a seed selection unit configured to calculate a weight of the extracted keyword and select a seed, an initial clustering unit configured to generate one cluster including the selected seed and a keyword shown by several times in a sentence including the selected seed, and a cluster combination unit configured to extract a topic group.
机译:公开了一种用于提取主题并评估所提取的主题的适合性的方法和服务器。主题提取服务器包括:文本预处理单元,被配置为从文档组中提取名词并从所提取的名词中去除停用词;关键词提取单元,被配置为计算名词的权重并提取代表文档组的关键字;种子选择单元配置为计算所提取的关键词的权重并选择种子,初始聚类单元,用于生成包括选择的种子和在包含选择的种子的句子中多次示出的关键词的一个聚类,以及配置为提取的聚类组合单元一个主题组。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号