首页> 外国专利> Web page classification program, Web page classification system and Web page classification method

Web page classification program, Web page classification system and Web page classification method

机译:网页分类程序,网页分类系统及网页分类方法

摘要

PROBLEM TO BE SOLVED: To properly extract advertisement pages without decreasing the precision of results obtained by extracting reputation information from Web pages and analyzing them.;SOLUTION: The Web page sorting program that makes a computer extract advertisement pages on which articles described by advertisers are put from Web pages on which articles are put on the Internet stores a list of phrases in which phrases consisting of unique expressions are registered, extracts phrases from the text information included in the Web pages, counts the number of cases in which the phases in the phase list match the extracted phrases, and extracts the advertisement pages from the Web pages based on the counts (Since it is assumed that many phrases consisting of unique expressions are included in the text information included in the advertisement pages, the Web pages that contain more phrases consisting of unique expressions than the set threshold are extracted as the advertisement pages, for example.).;COPYRIGHT: (C)2008,JPO&INPIT
机译:解决的问题:在不降低通过从网页中提取信誉信息并对其进行分析而获得的结果的准确性的情况下,正确地提取广告页面的方法;解决方案:使计算机提取广告页面的计算机上的分类页面程序,该广告页面上由广告商描述的文章从放置有Internet文章的网页上放置一个短语列表,在该短语列表中注册由唯一表达组成的短语,从网页中包含的文本信息中提取短语,计算出现在页面上的阶段数阶段列表匹配提取的短语,并根据计数从Web页面中提取广告页面(由于假定在广告页面中包含的文本信息中包含许多由唯一表达式组成的短语,因此包含更多内容的Web页面将包含比设置阈值高的唯一表达的短语提取为广告页面f或示例。);版权:(C)2008,JPO&INPIT

著录项

  • 公开/公告号JP5135701B2

    专利类型

  • 公开/公告日2013-02-06

    原文格式PDF

  • 申请/专利权人 富士通株式会社;

    申请/专利号JP20060094350

  • 发明设计人 高橋 哲朗;内野 寛治;

    申请日2006-03-30

  • 分类号G06F17/30;

  • 国家 JP

  • 入库时间 2022-08-21 16:53:47

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号