首页> 外国专利> The knowledge data gathering system, in the knowledge data gathering system which

The knowledge data gathering system, in the knowledge data gathering system which

机译:知识数据收集系统,其中的知识数据收集系统

摘要

PROBLEM TO BE SOLVED: To collect only information which is useful to users.;SOLUTION: A starting point URL, the number of stages of links whose information is to be collected and keywords representing unnecessary words are set in a setting file 13. A link label extraction module 113 extracts link character strings from page information collected from a network (the Internet/intranet) 20. A link determination module 114 determines whether the page information of a linking destination is useless from the extracted link character strings and the set keywords representing the unnecessary words. A collection control module 111 controls the collection of information from the network 20 by following the links from the starting point URL. The collection control module 111 does not collect the page information of the linking destination which is determined to be useless by the link determination module 114 even if the information is in a range of the number of the set stages.;COPYRIGHT: (C)2006,JPO&NCIPI
机译:解决的问题:仅收集对用户有用的信息。;解决方案:起点URL,要收集其信息的链接的级数以及在设置文件13中设置表示不必要单词的关键字。标签提取模块113从从网络(因特网/内联网)20收集的页面信息中提取链接字符串。链接确定模块114从提取的链接字符串和表示以下内容的设置关键字中确定链接目的地的页面信息是否无用:不必要的话。收集控制模块111通过跟随来自起点URL的链接来控制来自网络20的信息的收集。收集控制模块111不收集由链接确定模块114确定为无用的链接目的地的页面信息,即使该信息在所设置的阶段数的范围内。版权所有:(C)2006 ,JPO&NCIPI

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号