首页> 外国专利> Method for segmenting webpages by parsing webpages into document object modules (DOMs) and creating weighted graphs

Method for segmenting webpages by parsing webpages into document object modules (DOMs) and creating weighted graphs

机译:通过将网页解析成文档对象模块(DOM)并创建加权图来分割网页的方法

摘要

A method of segmenting a webpage into visually and semantically cohesive pieces uses an optimization problem on a weighted graph, where the weights reflect whether two nodes in the webpage's DOM tree should be placed together or apart in the segmentation; the weights are informed by manually labeled data.
机译:一种将网页分割为视觉和语义上有凝聚力的片段的方法,是在加权图上使用优化问题,其中权重反映了网页DOM树中的两个节点在分割时应放置在一起还是分开放置;权重由手动标记的数据告知。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号