首页> 外国专利> RELEVANT DOCUMENT EXTRACTION DEVICE, RELEVANT DOCUMENT EXTRACTION METHOD AND RELEVANT DOCUMENT EXTRACTION PROGRAM

RELEVANT DOCUMENT EXTRACTION DEVICE, RELEVANT DOCUMENT EXTRACTION METHOD AND RELEVANT DOCUMENT EXTRACTION PROGRAM

机译:相关文件提取设备,相关文件提取方法和相关文件提取程序

摘要

In the present invention, documents relevant to a specific topic are suitably extracted from documents such as a plurality of tweets. A relevant document extraction device (10) is provided with the following: a default topic tag storage unit (141) that stores a default topic tag indicating a topic; a document storage unit (100) that stores a plurality of documents; a morpheme analysis unit (110) that divides documents into morphemes; a topic tag estimation unit (130) that extracts a document that includes the default topic tag from a plurality of documents, and calculates the frequency of appearance of terms in the extracted document; and a topic ID assigning unit (150) that extract a document relevant to the topic from information based on the calculated frequency of appearance.
机译:在本发明中,从诸如多个推文的文档中适当地提取与特定主题有关的文档。相关文档提取装置(10)具有以下内容:默认主题标签存储单元(141),其存储指示主题的默认主题标签;以及默认主题标签存储单元(141)。文档存储单元(100),其存储多个文档;词素分析单元(110),其将文档分成词素;主题标签估计单元(130),从多个文档中提取包括默认主题标签的文档,并计算所提取的文档中术语出现的频率;主题ID分配单元(150),基于计算出的出现频率,从信息中提取与该主题有关的文档。

著录项

  • 公开/公告号WO2014021229A1

    专利类型

  • 公开/公告日2014-02-06

    原文格式PDF

  • 申请/专利权人 NTT DOCOMO INC.;

    申请/专利号WO2013JP70376

  • 申请日2013-07-26

  • 分类号G06F17/30;

  • 国家 WO

  • 入库时间 2022-08-21 15:51:53

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号