首页> 外国专利> EXTRACTION OF ANCHOR EXPLANATORY TEXT BY MINING REPEATED PATTERNS

EXTRACTION OF ANCHOR EXPLANATORY TEXT BY MINING REPEATED PATTERNS

机译:挖掘重复模式提取锚定解释性文本

摘要

A method and system for identifying explanatory text for a referenced web page based on a reference to the referenced web page contained in a repeated pattern of a referencing web page is provided. An anchor explanatory text (“AET”) system uses the hierarchical organization of the web page to identify a repeated pattern of hierarchical elements that contain references to other display pages. After the AET system identifies a repeated pattern, it identifies the dominant reference or anchor within each occurrence of the pattern. The AET system uses the explanatory text surrounding a dominant anchor as a description of the referenced web page.
机译:提供了一种用于基于对参考网页的重复模式中包含的对参考网页的参考来识别参考网页的说明文本的方法和系统。锚定说明文本(“ AET”)系统使用网页的层次结构来标识包含对其他显示页面的引用的层次元素的重复模式。在AET系统识别出重复的模式后,它将识别出每次出现模式时的主要参考或锚点。 AET系统使用围绕主要锚点的解释性文本作为参考网页的描述。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号