首页> 外国专利> DOCUMENT FEATURE EXTRACTION DEVICE, DOCUMENT FEATURE EXTRACTION METHOD, AND DOCUMENT FEATURE EXTRACTION PROGRAM

DOCUMENT FEATURE EXTRACTION DEVICE, DOCUMENT FEATURE EXTRACTION METHOD, AND DOCUMENT FEATURE EXTRACTION PROGRAM

机译:文档特征提取设备,文档特征提取方法和文档特征提取程序

摘要

PROBLEM TO BE SOLVED: To appropriately extract a feature corresponding to a browser's browsing intention using a reference relationship between structured documents.;SOLUTION: A browsing history recording unit 2 of a document feature extraction device 1 records a browsing history of each browser in a browsing history set DB 3. A feature extraction unit 4 extracts a link and related text of the link from a structured document as a link source included in the browsing history in the DB 3. Words are extracted from body text as a representative portion in a structured document as a link destination including the extracted information. A feature recalculation unit 5 calculates weighting for the extracted words. An output unit 6 outputs the extracted words in a priority order corresponding to the weighting.;COPYRIGHT: (C)2013,JPO&INPIT
机译:解决的问题:使用结构化文档之间的参考关系来适当地提取与浏览器的浏览意图相对应的特征。解决方案:文档特征提取装置1的浏览历史记录单元2在浏览中记录每个浏览器的浏览历史。历史集DB3。特征提取单元4从结构化文档中提取链接和链接的相关文本,作为包含在DB 3中的浏览历史中的链接源。从正文中提取单词作为结构化中的代表部分。文档作为包含提取信息的链接目标。特征重新计算单元5计算所提取的单词的权重。输出单元6以与加权相对应的优先级顺序输出所提取的单词。COPYRIGHT:(C)2013,JPO&INPIT

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号