首页> 外国专利> COMPUTERIZED ASSESSMENT OF ARTICLES WITH SIMILAR CONTENT AND HIGHLIGHTING OF DISTINCTIONS THEREBETWEEN

COMPUTERIZED ASSESSMENT OF ARTICLES WITH SIMILAR CONTENT AND HIGHLIGHTING OF DISTINCTIONS THEREBETWEEN

机译:对内容相似的文章进行计算机评估并突出两者之间的区别

摘要

A computer receives a list of reference topics from a topic database and a set of articles related to said reference topics. The computer generates article n-grams and compares them to the reference topics using NLP to determine a primary theme for each article that corresponds to one of reference topics. The computer collects articles with common primary themes into at least one article group and determining an article comparison value between articles in the article group. Responsive to determining that an article comparison value is below a predetermined similarity threshold, determining a distinguishing feature associated with one of the compared articles that contributed to the article comparison value. The computer assigns articles having the distinguishing feature into a secondary group based, at least in part, on the distinguishing feature.
机译:计算机从主题数据库接收参考主题列表和与所述参考主题相关的一组文章。计算机生成文章n图,并使用自然语言处理将其与参考主题进行比较,以确定与其中一个参考主题相对应的每篇文章的主主题。计算机将具有共同主主题的文章收集到至少一个文章组中,并确定文章组中文章之间的文章比较值。响应于确定物品比较值低于预定的相似性阈值,确定与促成物品比较值的所比较物品之一相关联的区别特征。计算机至少部分地基于区别特征将具有区别特征的物品分配到第二组。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号