首页> 外国专利> Being the time serial document intensive manner which also the axial keyword which is the central keyword which

Being the time serial document intensive manner which also the axial keyword which is the central keyword which

机译:作为时间序列文件的密集方式,也是中心关键词的轴向关键词

摘要

PROBLEM TO BE SOLVED: To collect documents handling the same subject matter with a document having a plain style, such as a diary and one's thoughts, described in an arbitrary style as a target.;SOLUTION: A time-series document grouping device analyzes an input document, acquires a general word that is a word in the document and the appearance position information of the general word, acquires a keyword that becomes the candidate of a main keyword and the appearance position information of the keyword in the input document, totals the keyword, selects a frequent keyword, divides a document including the selected keyword into a subset, determines whether a document in the subset describes the same topic, based on the result of the totalled general word appearing within a predetermined range around the keyword in the document in the subset, integrates the determination result for each subset, and finally determines whether the keyword is the center of the topic.;COPYRIGHT: (C)2007,JPO&INPIT
机译:解决的问题:要收集处理同一主题的文档,并以任意风格描述的具有简单风格的文档(例如日记和思想)作为目标;解决方案:时序文档分组设备可以分析输入文档,获取作为文档中的单词的普通单词和该普通单词的出现位置信息,获取成为主关键字候选者的关键字和该关键字在输入文档中的出现位置信息,总计关键字,选择一个频繁使用的关键字,将包含所选关键字的文档划分为一个子集,根据总的通用词在文档中关键字周围预定范围内出现的结果,确定该子集中的文档是否描述了同一主题在子集中,对每个子集的确定结果进行积分,最后确定关键字是否是主题的中心。;版权:(C)2007,JPO&IN坑

著录项

  • 公开/公告号JP4595692B2

    专利类型

  • 公开/公告日2010-12-08

    原文格式PDF

  • 申请/专利权人 日本電信電話株式会社;

    申请/专利号JP20050175538

  • 发明设计人 佐藤 吉秀;大久保 雅且;

    申请日2005-06-15

  • 分类号G06F17/30;

  • 国家 JP

  • 入库时间 2022-08-21 18:16:48

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号