首页> 外国专利> Index term extraction apparatus surveyed document, personality expression diagrams, and document feature analyzer

Index term extraction apparatus surveyed document, personality expression diagrams, and document feature analyzer

机译:索引词提取装置调查文件,个性表达图和文件特征分析器

摘要

The first appearance frequency calculating means for calculating the index words and extracting index terms extraction means, the extracted index words surveyed document d in the function value IDF frequency of occurrence in the comparison target document group P (P) is , and similar documents group elected means that based on the data of the survey document d, to elect a similar document set S that is similar to the survey document d from the comparison document group P, and index terms that have been the extraction, similar document group and a second appearance frequency calculating means for calculating function value IDF frequency of appearance in the S a (S), on the combination of the function value of each frequency of occurrence for each index term, in a similar set of documents and comparison group of documents is the calculated Based on, and an output means for outputting the data of the time series variation and its positioning and its respective index words. As a result, it is possible to have accurate understanding of the temporal transition and nature of the survey document without reading the document.
机译:用于计算索引词的第一出现频率计算装置和提取索引词提取装置,在比较目标文档组P(P)中以函数值IDF的出现频率提取所抽取的索引词调查文档d,并选择相似文档组。表示根据调查文件d的数据,从比较文件组P中选择与调查文件d类似的相似文件集S,以及已提取的索引项,相似文件组和第二次出现频率计算装置,用于计算函数值IDF在S a(S)中出现的频率,根据每个索引项的每个出现频率的函数值的组合,计算出一组相似的文档和一组比较文档根据一个输出装置,用于输出时间序列变化及其位置和各自的索引字的数据。结果,可以在不阅读文件的情况下准确地了解调查文件的时间转变和性质。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号