首页> 外国专利> Textual document analysis using word cloud comparison

Textual document analysis using word cloud comparison

机译:使用词云比较进行文本文档分析

摘要

A system and method textually analyze documents. A frequency distribution is generated for the documents, and an intersection between the documents is determined. For each word in the intersection, the frequency of the word in the first document is compared with the frequency of the word in the second document, and the lower frequency is selected. A similarity measure between the first document and the second document is determined as a function of a count of the words in the intersection, a count of the words in the second document, the selected lower frequencies, and the frequency distribution for the words in the second document.
机译:一种系统和方法以文本方式分析文档。为文档生成频率分布,并确定文档之间的交点。对于相交中的每个单词,将第一文档中单词的频率与第二文档中单词的频率进行比较,并选择较低的频率。确定第一文档和第二文档之间的相似性度量取决于交集中单词的数量,第二文档中单词的数量,所选较低频率以及单词中单词的频率分布第二份文件。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号